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ABSTRACT 
We measure the luminosity and color dependence of galaxy clustering in the largest-ever galaxy redshift 
survey, the main galaxy sample of the Sloan Digital Sky Survey (SDSS) Seventh Data Release (DR7). 
We focus on the projected correlation function w p (r p ) of volume-limited samples, extracted from the 
parent sample of ~ 700,000 galaxies over 8000 deg 2 , extending up to redshift of 0.25. We interpret 
our measurements using halo occupation distribution (HOD) modeling assuming a ACDM cosmol- 
ogy (inflationary cold dark matter with a cosmological constant). The amplitude of w p (r p ) grows 
slowly with luminosity for L < L* and increases sharply at higher luminosities, with a large-scale 
bias factor b(> L) x (er 8 /0.8) = 1.06 + 0.21(L/L») 112 , where L is the sample luminosity threshold. 
At fixed luminosity, redder galaxies exhibit a higher amplitude and steeper correlation function, a 
steady trend that runs through the "blue cloud" and "green valley" and continues across the "red 
sequence." The cross-correlation of red and blue galaxies is close to the geometric mean of their auto- 
correlations, dropping slightly below at r p < 1 h^ 1 Mpc. The luminosity trends for the red and blue 
galaxy populations separately are strikingly different. Blue galaxies show a slow but steady increase of 
clustering strength with luminosity, with nearly constant shape of w p (r p ). The large-scale clustering 
of red galaxies shows little luminosity dependence until a sharp increase at L > 4L* , but the lowest 
luminosity red galaxies (0.04 — 0.25L*) show very strong clustering on small scales (r p <2h~ l Mpc). 
Most of the observed trends can be naturally understood within the ACDM+HOD framework. The 
growth of w p (r p ) for higher luminosity galaxies reflects an overall shift in the mass scale of their host 
dark matter halos, in particular an increase in the minimum host halo mass M m ; n . The mass at 
which a halo has, on average, one satellite galaxy brighter than L is A/i « 17M m ; n (L) over most of 
the luminosity range, with a smaller ratio above L*. The growth and steepening of w p {r p ) for redder 
galaxies reflects the increasing fraction of galaxies that are satellite systems in high mass halos instead 
of central systems in low mass halos, a trend that is especially marked at low luminosities. Our exten- 
sive measurements, provided in tabular form, will allow detailed tests of theoretical models of galaxy 
formation, a firm grounding of semi-empirical models of the galaxy population, and new constraints 
on cosmological parameters from combining real-space galaxy clustering with mass-sensitive statistics 
such as redshift-space distortions, cluster mass-to-light ratios, and galaxy-galaxy lensing. 

Subject headings: cosmology: observations — cosmology: theory — galaxies: distances and redshifts 
— galaxies: halos — galaxies: statistics — large-scale structure of universe 
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1. INTRODUCTION 

Three-dimensional maps of the large-scale distri- 
bution of galaxies reveal a rich network of filaments 
and sheets, punctuated by dense clusters and in- 
terleaved with low d ensity tunnels and b ubbles 
dGregorv k Thompson) 119781: iKirshner et all 119811: 

ncarini 



Davis et all Il982t 



Giovanelli. Havnes. k Ch 



19861: [Geiler k Huchral 119891: iShectman et all 1 199ft 



Colless et all 1200111 Different classes of galaxies trace 
this structure differently, with early type galaxies 
residing preferentially in rich groups and clusters and 
late type galaxies residing preferentially in the filaments 
and walls; this segregation of clustering was already 
evident in two-dimensional studies as early as iHubbld 
(1936). Galaxy surveys map the distribution of visi- 
ble baryons, but a combination of observational and 
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theor etical arguments, beginning with Zwicky (|1933l 
119371 ). show that the galaxies trace an underlying 
network of invisible, gravitationally dominant dark 
matter. In this paper, we measure the clustering of 
galaxies as a function of luminosity and color in the 
largest galaxy redshift sury ey to date, the main galaxy 
sample (|Strauss et alll20 02) of the Seventh Data Release 
(DR7; lAbazaiian et all 120091) of the Sloan Digital Sky 
Survey (SDSS; |York_et_aL| |2000) . Our primary tool is 
the two-point correlation function £(r), which provides 
a simple, robus t, and inform ative measure of galaxy 
clustering (e.g., iPeebles! Il980t ). More specifically, we 
focus on the projected correlation function w p (r p ), 
which integrates out redshi ft-space distortions cau sed by 
galaxy peculiar velocities ([Davis fc Peebles! [1983). By 
modeling our measurements in the context of the ACDM 
cosmological framework (inflationary cold dark matter 
with a cosmological constant), we infer the relation 
between different classes of galaxies and the underlying 
distribution of dark matter, providing fundamental tests 
for theories of galaxy formation. 

Over the last few decades, a variety of "local" 
clustering studies have established an increasingly re- 
fined and quantitative characterization of the depen- 
dence of galaxy clustering on lumino sity, morphol- 
ogy, color, and s pectr al type (e . g., [Davis fc Gelled 
19761 : iDavis et all 119881: [Hamilton Tl988t lAlirni et all 



J? 



1988; Valls-Gaba ud et all 119891: lLovedav et all 1995 



Benoist et all 119961: iGuzzo et all Tl997t IWillmer et al 



1998; 
2001, 



iBrown. Webster fc Boyle 



2000; 



Norberg et al. 



2002; 



2002; Budavari et al. 



Zeh avi et all 
Madewick et alll2003|:IZehavi et alJl2005bH Li et all 



2003; 



2006; 



Swanson et all 120081 : lLoh et alll2010l ). Luminous galax- 



ies generally cluster more strongly than faint galaxies, 
reflecting their tendency to reside in denser environ- 
ments. Galaxies with bulge-dominated morphologies, 
red colors, or spectral types indicating old stellar pop- 
ulations also exhibit stronger clustering and a prefer- 
ence for dense environments. Significant progress has 
also been made in recent years in measuring galaxy 
clustering at intermediate and high redshifts (e.g 



Brown et all 120031: iDaddi et all l2003t lAdelberger et al 
20051: lOuchi et all 120051: iLee et all 120061: iPhleps et al 



20061 iGoiT et al. 2006b, 2008; M eneux et al.ll2008L 12009b 



Abbas et al.l l2010). 



Cosmological inferences from galaxy clustering mea- 
surements are complicated by the existence of galaxy 
bias, the difference between the distribution of galaxies 
and that of the underlying dark matter. While the 
gravitational clustering of dark matter from specified 
initial conditions can be computed reliably with cos- 
mological iV-body simulations, the detailed physics 
of galaxy formation - - gas cooling, star formation, 
and the feedback effects of star formation and black 
hole accretion — is only partly understood, so galaxy 
bias cannot be predicted robustly from first principles. 
Cosmological parameter studies must adopt a math- 
ematical description of galaxy bias and marginalize 
over its uncertain parameters. This procedure is most 
straightforward at large scales, where the effects of bias 
are expected to be simple, i.e., a sca le- indep endent 



amplification of the matter £(r) dKaise: 



Bardeen et al.lll986l: IColeslll993l: iFrvfc Gaztanagal 



1984; 



1993; 



119981: iNaravanan. Berlind. fc Weinberg! I2000D . Con- 
versely, for a specified cosmological model, one can 
constrain detailed descriptions of galaxy bias and thus 
gain insights into galaxy formation physics. 

In the cold dark m atter scenario ([Peebles! 119821 : 
iBlumenthal et al.l 11984) . which is now s upported by a 
wide r ange of observatio nal evidence (e.g-. lDunklev et all 
120091 : iReid et all |2010[ ). galaxies form and reside in 
extended dark matter halos. The existence of such 
halos is well establis hed by studies of spiral galaxy 
rotation curves (e.g., Rubin, Thonnard. fc Ford 119781 : 
iPersic. Salucci. fc Stell Il996fc IVerheiienl I2001D and the 
stellar dynamics (e.g., Gerhard et all 1200 ll) and grav- 
itational lensing (e.g., iBolton et all l2008f) of ellipti- 
cal galaxies. Studies of weak lensing and satellite 
galaxies show that the halos of luminous galaxies ex- 
tend to hundreds of kpc, where they join smoothly 
onto the larger scale d i stribution of dark matter (e.g., 
IZaritskv fc White! [1 994; Fis cher et al.l 120001 : iPrada et all 
120031: Mandel baum et all 120061 ) . The formation of dark 
matter halos is dominated by gravity and can be well 
predicted for a given cosmology from high-resolution nu- 
merical simulations and analytic models. Dark matter 
halos thus become the natural bridge for connecting the 
galaxy distribution and the matter distribution. 

In recent years, the theoretical understanding of galaxy 
clustering has been enhanced through development of 
the Halo Occupation Di stribution (HOD) frame- 
work (e.g., Uing. Mo. fc Bornerl H998J: iMa fc Frvl 120001: 
Peacock & Smith 2000; Scliak 2000; Scoccimarro et al. 
20011: iBerlind fc Weinberg] 120021: ICoorav fc Shethr2002 



Yang. Mo. fc van den B osch 2003; K ravtsov et all 12004 : 
Zheng et all 12005! ) . The HOD formalism describes the 



"bias" relation between galaxies and mass at the level of 
individual dark matter halos, in terms of the probability 
distribution that a halo of virial mass Mh contains N 
galaxies of a given type, together with prescriptions for 
the relative spatial and velocity bias of galaxies and 
dark matter within virialized halos. The combination 
of a cosmological model and a fully specified HOD can 
predict any galaxy clustering statistic on any scale, 
allowing integrated constraints from many observations. 
For an assumed cosmological model and a parameterized 
form of the HOD moti yated by contemporary theories of 
galaxy formation (e.g., Kauffmann, Nusser. & Stcimnetz 



19971: iKauffman et al.l 119991: Benson et al.l 1200 



Berlind et al.l 120031: Kravtsov et al.l 12004 IZheng et all 



20051 : IConrov. Wechsler fc Kravtsov! 120061 ). measure- 



ments of w p (r p ) are already highly constraining, and 
HOD modeling transforms data on galaxy pair counts 
into a physical relation between galaxies and dark mat- 
ter halos. HOD modeling has been applied to interpret 
clustering dat a from a number of surveys at low and high 
redshifts (e.g.,[j ing & Borncr 1998; Jing. Borncr. fc Sutol 
2002t [ Bullock et al. 2002; Moustakas fc Somervilldl2002l: 

Magliocchctti & Porciani 
120031: iZhend 12004 



van den Bosch ct al 

20031: lYan. Madgwick. fc Whit 



2003a; 



Yang et al.l 12005b!: [Zehayi et all l2005bt 



20061 lHamana et al.l 120061: ILee et al.l l2006l 

" IWhite et all 
IBrown et all 



Zheng, Coil fc Zehav i 
Blake. Collister fc Lahavl 



2007 ; 
IgOOgl : 



Quadri ct al. 2008; Wake ct al. 2008; Kim ct al. 




Zheng et all 120091: iRoss. Percival fc Brun ncr 2010). 



Mann. Peacock, fc Heavens! 119981 : iScherrer fc Weinberg! In principle, a complete model of galaxy bias might 
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need to allow for the possibility that the average galaxy 
content of halos depends on large-scale environment as 
well as halo mass, since halo concentrations and as- 
sembly histories show s o me environmental corre l ations 
(ISheth fe Toimenl J200l iGao. Springe! fc White! J2QQJ 
Harker et al. 2006; Wcchslcr et al. 2006; Zhu ct al. 2006 



Croton. Gao fe White! 120071: iJing. Suto. fe Mol 1200'; 
Wetzel et al.r T2007; D alai et all 120081; IZu et"aTl 1200 



However, studies assuming an environment-independent 
HOD have proven successful at e xplainin g galaxy clus- 
terin g in different density r egimes (lAbbas fe Sheth|[2~006l 
20071 ITinker et al J 12008 ; Skib ba et all 120091 : see also 
Blanton et al. 2006; Blanton & Berlind 2007), and the- 



oretical models predict only a small impact of such 
"halo assembly bias" on galaxy clustering statistics 
for mass- or luminosity-thresholded samples ([Yoo et al.l 
[200l ICroton. Gao fc Whitdl2007l: IZu et al.ll2008l) ~ 

The present paper builds upon our investigation 
of galaxy correlat ions in early SDSS redshift data 
(|Zehavi et al.1 [2002) , our use of HOD modeling to in- 
terpret deviations from a power-law in the ga laxy two- 
point correlation function (|Zehavi et al.l 120041 ) . and, es- 
pecially, our earlier investigation of luminosity and color 
dependence of the galaxy correlation function in a sam- 
ple of about 200,000 SDSS galaxies (Zehavi et al. 2005b, 
hereafter Z05). Here we take advantage of the fi- 
nal SDSS galaxy sample — roughly three times more 
galaxies once appropriate cuts are applied — and ad- 
vances in HOD modeling methods to obtain higher pre- 
cision measurements and tighter, more informative con- 
straints on galaxy-halo relations. This study comple- 
ments correlation function measurements and HOD mod- 
els of the SDSS Luminous Red Galaxy (LRG) sam- 
ple (lEisenstein et al.l 120011 l2005bl; [ Zehavi et all p 005a; 
IZheng et al.l2009HKazin et al.ll2010l;IWatson et al.ll2010D . 



which pro bes the most luminous galaxies out to redshift 
z ~ 0.45 (Eis enstein et al.ll2001l ). We focus our anal- 
ysis on volume-limited samples of well defined galaxy 
classes, which allows us to construct HOD models with 
a small number of free parameters to interpret the mea- 
surements for each class. This approach complements 
other analyses of the SDSS main galaxy sample that mea- 
sure l uminosity or stellar mass - weighted correlation func- 
tions (|Li fc Whitell2009L |2010T ) or use marked correlation 
functions to quantify luminosity and colo r dependence 
(|Skibba et al.ll200l ISkibba fc Shethl 120091) . These anal- 
yses typically yield smaller error bars because they use 
more sample galaxies for the measurement, but they re- 
quire a more complete global description of the galaxy 
population to model the results. There are many par- 
allels between our program and the one pursued by 
van den Bosch, Mo, Yang, and their collaborators (e.g., 
papers cited above and Ivan den Bosch. Mo. fc Yangl 
2003bt; Wcinm aruTet al.ll2006Hvan den Bosch et al.ll2007t 



Yang. Mo. fc v an den Bosch 2008; Mo re et al.l [2009h . 



though t hey have largely focused on analysis of group 
catalogs (jYang et al.1 l2005aL 120071 ) rather than detailed 
fitting of the correlation function. The two approaches 
yield qualitatively similar r esults (e.g., Z05; lYang et al. 



20051* IZheng et all 120071: lYang. Mo. fc van den B osch 
20081) 



Our correlation function measurements provide ba- 
sic empirical characterizations of large-scale structure 
at low redshift (z < 0.25), and the luminosity and 



color dependence of these correlation functions can test 
predi ctions from hydrodynamic cosmological si mulations 
(e.g., iPearce et al.l 12001; IWeinberg et al. 20041) or semi- 



analytic models (e.g., jKang et al.l 120051 ; ICroton et al.l 
120061 iBower et alIl2006D . The derived HOD constraints 
provide informative tests of galaxy formation models, 
a low redshift baseline for evolutio nary studies (e.g., 
IZheng et al]|2007t iBrown et al.ll2008D . and a description 
that can be used to create realistic mock catalogs from 
simulations (IScoccimarro fc Shethl 120021: 1 Wechslerl 12004 
lEisenstein et al.l l2005bt ISkibba fc Shethl [2009J McBride 
et al., in preparat i on). As discussed extensively by 
IZheng fc Weinberg! (|2007l ). the HOD formalism can also 
be used in cosmological parameter determinations, al- 
lowing marginalization over the parameters of a bias 
prescription that applies to a wide range of cluster- 
ing statistics from the linear to the highly non-linear 
regime. Combinations of spatial clustering statistics 
and dynamically sensitive measures (such as galaxy- 
galaxy lensing, redshift-space distortions, or group and 
cluster mass-to-light ratios) can break the main de- 
generacies between cosmological parameters and galaxy 
bias. A number of papers have implemented variants 
of this approach to constrain the matter density pa- 
ra meter £l m and the a mplitud e of matter clustering 
as (Ivan den Bosch. Mo. fc Yang 2003b; Abazaiia n et all 
20051: ITinker et al.l 120051: Ivan den Bosch et al.l 120071: 



Cacciato et all 120091; iRozo et al.M2010D . These analy- 



ses argued for a significant downward revision of the 
WMAP1 values of J7 m and/or eg, anticipating the pa- 
rameter changes that occurred with WMAP3 (see also 
I Vale fc Ostrikerl|2006l who reached a similar conclusion 
by a related method) . Our correlation function measure- 
ments are providing essential constraints for such analy- 
ses using the SDSS DR7 data set. 

The paper is organized as follows. Section [2] describes 
the SDSS data and the methods we use to measure galaxy 
clustering and to interpret it via HOD modeling. In JJ]we 
present results on the luminosity dependence of w p (r p ) 
and its implications for HOD models. In §[4] we examine 
the dependence of clustering on galaxy color, including 
cross-correlations between red and blue galaxy samples, 
and we investigate the luminosity dependence for red and 
blue galaxies separately. Section [5] summarizes our re- 
sults. Appendix A discusses some technical issues relat- 
ing to predictions of the galaxy cross-correlation func- 
tion. Appendix B illustrates the robustness of our mea- 
surements to different systematics. Appendix C presents 
in tabular form the w p (r p ) measurements for most of the 
samples discussed in the paper. 

2. OBSERVATIONS AND METHODS 

2.1. Data 



The Sloan D i gital Sky Survey (|York et al.l l2000l: 
IStoughton et al.l l2002t ) was an ambitious project to 
map most of the high-latitude sky in the northern 
Galactic cap, usin g a dedicated 2.5 meter telescope 
(jGunn et all [2006) . The survey started regular oper- 
ations in April 2000 and completed observations (for 
SDSS-II ) in July 2008. A drift-scanning mosaic CCD 
camera ()Gunn et all 1 19981) imaged the sky in five pho- 
tome tric bandpasses ()Fukugita et all 119961 : iSmith et all 



2002) to a limiting magnitude of r ~ 22.5. The imaging 
data were processed through a series of pipelines that 
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perform astrometric calibration ([Pier et all [20 031 . pho- 
tometric reduction (iLuoton et al.lll999t 120011) and pho- 
tometric calib ration (H ogg et al. 200lUlyezic et alj|2004t 
iTucker et all 120061: iPadmanabhan et al.ll2008l) . Objects 
were selected for spectroscopic followup using specific 
algor ithms for the main ga laxy sample (IStrauss et al.1 
I2002D . lu minous red galaxies (lEisenstein et al.ll2001l) . and 
quasars ([Richards et al.l |2002[ ). Targets were assigned 
to spe ctroscopic plates usin g an adaptive tiling algo- 
rithm ([Blanton et a l. 2003a) and observed with a pair 
of fiber-fed spectrographs. Spectroscopic data reduction 
and redshift determination were performed by automated 
pipelines. Galaxy redshifts were measured with a success 
rate greater than 99% and typical accuracy of 30 km s -1 . 
To a good approximation, the main galaxy sample con- 
sists of all galaxies with Petrosian magnitude r < 17.77, 
with a median redshift of ~ 0.1. The LRG redshift 
sample uses color-magnitude cuts to select galaxies with 
r < 19.5 that are likely to be luminous early-type galax- 
ies, extending up to redshift ~ 0.5. 

Galaxy samples suitable for large-scale structure stud- 
ies have be en carefully construct ed from the SDSS red- 
shift data (iBlanton et al.l l2005bD. All magnitudes are 
K-corrected (|Blanton et al.ll2003bl) and evolved to rest- 
frame magnitudes at z — 0.1 (which is near the median 
redshift of the sample and thus minimizes corrections) 
using an upd ated version of the evol ving luminosity func- 
tion model of iBlanton et al.l ([2003d ) 15 . The radial selec- 
tion function is derived from the sample selection crite- 
ria. When creating volume-limited samples below, we 
include a galaxy if its evolved, redshifted spectral energy 
distribution places it within the main galaxy sample's 
apparent magnitude and surface brightness limits at the 
limiting redshift of the sample. The angular complete- 
ness is characterized carefully for each sector (a unique 
region of overlapping spectroscopic plates) on the sky. 

Due to the placement of fibers to obtain spectra, no two 
targets on the same plate can be closer than 55". This 
results in ~ 7% of targeted galaxies not having a mea- 
sured redshift. We assign these galaxies the redshift of 
their nearest neighboring galaxy; roughly speaking, this 
method double-weights the galaxy that was observed, 
but it retains the additional information present in the 
angular position of the "collided" galaxy. As shown in 
Z05, this treatment works remarkably well for projected 
statistics such as w p (r p ), above the physical scale corre- 
sponding to 55" (r p sa 0.13 ft. -1 Mpc at the outer edge 
of our sample) . We thus limit the measurements in this 
paper to scales larger than that. The median deviation 
of w p (r p ) for the range of separations we utilize is 0.2%, 
much less than the statistical errors on the measurements 
(Fig. 3 in Z05). It is, in fact, possible to correct for fiber 
collisions down to scales as small as 0.01 h~ l Mpc us- 
ing the ratio of small-ang le pairs in the spectroscopic 
and photometric catalogs (jMasjedi et al.1 120061 : iLi et al.1 
2006; Li & White 2009), but we have not implemented 
this technique here. 

The clusterin g measurements in th is paper are based 
on SDSS DR7 (JAbazaiian et al.ll2009ft . which marks the 
completion of the original goals of the SDSS and the end 
of the phase known as SDSS-II. The associated NYU 
Value-Added Galaxy Catalog (NYU-VAGC) 15 includes 

15 http://sdss.physics.nyu.edu/vagc/lss.html 



approximately 700, 000 main sample galaxies over about 
8000 deg 2 on the sky. This data set can be compared to 
the much smaller sky coverage of the samples in pre- 
vious correlatio n function a n alyses of the SDSS main 
galaxy sample: IZehavi et al.l (|2002l ) used an early sam- 
ple of ~ 700 deg 2 , and Z05 analyzed a sample of about 
2500 deg 2 . The contiguous northern footprint of DR7 of- 
fers further advantage over earlier data sets by reducing 
boundary effects. Figures [T][3] show the distribution of 
the main sample galaxies in right ascension and redshift 
for slices near the celestial equator. These plots nicely 
illustrate the large-scale structure we aim to study using 
the two-point correlation function, as well as the poten- 
tial dependencies on galaxy properties. Diagrams that 
show contigui ty of st r uctur e over multiple SDSS slices 
appear in IChoi et al.1 ((2010), who analyze the topology 
of large scale structure in the DR7 main galaxy sample. 

Throughout this paper, we refer to distances in co- 
moving units, and for all distance calculations and abso- 
lute magnitude definitions we adopt a flat ACDM model 
with 51 m = 0.3. We quote distances in h^ 1 Mpc (where 
h = _ffo/100 km s _1 Mpc -1 ), and we quote absolute 
magnitudes for h = 1. Our correlation function mea- 
surements are strictly independent of Hq, except that 
the absolute magnitudes we list as M r are really values 
of M r + 5 log h. Changing the assumed O m or Ha would 
have a small impact on our measurements by changing 
the distance-redshift relation and thus shifting galaxies 
among luminosity bins and galaxy pairs among radial 
separation bins. However, even at our outer redshift limit 
of z = 0.25, the effect of lowering J7 m from 0.3 to 0.25 is 
only 1% in distance, so our measurements are effectively 
independent of cosmological parameters within their ob- 
servational uncertainties. 

In order to work with well-defined classes of galaxies, 
we study volume-limited samples constructed for vary- 
ing luminosity bins and luminosity thresholds. While 
volume-limited subsamples include fewer galaxies than 
the full flux-limited sample, they are much easier to inter- 
pret. For a given luminosity bin, we discard the galaxies 
that are too faint to be included at the far redshift limit 
or too bright to be included at the near limit. We include 
galaxies with 14.5 < r < 17.6, with the conservative 
bright limit imposed to avoid small incompletenesses as- 
sociated with galaxy deblending (the NYU-VAGC safe 
samples). We further cut these samples by color, us- 
ing the K-corrected g — r color as a separator into dif- 
ferent populations. We also study a set of luminosity- 
threshold samples, namely volume-limited samples of all 
galaxies brighter than a given threshold, as these yield 
higher precision measurements than luminosity-bin sam- 
ples and are somewhat more straightforward for HOD 
modeling. For these samples we relax the bright flux 
limit to r > 10.0, in order to be able to define a viable 
volume-limited redshift range (the NYU-VAGC bright 
samples). The distribution in magnitude and redshift 
and the cuts used to define the samples are shown in 
Figure SJ Details of the samples are given in Tables Q] 
and[2l For luminosity-threshold samples, one could im- 
prove statistics by using the flux-limited galaxy cata- 
log and weighting galaxy pairs by the inv erse volume 
over which t hey can be observed, as done bv lLi fc White 
(2009, 2010) for samples weighted by stellar mass and lu- 




Fig. 1. — The distribution of galaxies in the SDSS main galaxy sample. Only galaxies within ±1.25 degrees of the Celestial Equator are 
shown. 



minosity. This procedure would extend the outer redshift 
limit for the more luminous galaxies above the threshold, 
thus reducing sample variance, but it has the arguable 
disadvantage of using different measurement volumes for 
different subsets of galaxies within the sample, and we 
have not implemented it here. 

The full spectroscopic survey of the SDSS DR7 Legacy 
survey contains 900,000 unique, survey-quality galaxy 
spectra over 8000 deg 2 . Of these objects, the main galaxy 
sample target criteria selected 700,000. SDSS targeted 
the remainder as luminous red galaxy candidates (around 
100,000) or in other categories (e.g., as quasar candi- 
dates or in special programs on the Equator). We use 
a reduced footprint of 7700 deg 2 , which excludes areas 
of su spect photometric calibration (|Padmanabhan et al.l 
2008) and incomplete regions near bright stars. This re- 
duction leaves 670,000 main sample galaxies. Because we 
are using an updated photometric reduction, a substan- 
tial fraction of targets are assigned fluxes fainter than 



the original flux limit, which further reduces the sam- 
ple to about 640,000 galaxies. For uniformity we have 
imposed an even stricter faint limit of r = 17.6 in this 
paper, which yields 540,000 galaxies. About 30,000 of 
the original targets at that flux limit were not assigned 
fibers because of fiber collisions; we assign these objects 
the redshift of their nearest neighbor as discussed above. 
The resulting sample of 570,000 galaxies constitutes the 
parent sample for all of the volume-limited samples in 
this paper. When we apply a bright magnitude cut of 
r = 14.5, it eliminates about 6,000 galaxies. Further de- 
tails and the samples themselves are available as part of 
the public NYU-VAGC datasets. 

2.2. Clustering Measures 

The auto-correlation function is a powerful way to 
characterize galaxy clustering, measuring the excess 
probability over random of finding pairs o f galaxies as a 
function of separation (e.g.. iPee blcs 1980|). To separate 
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Fig. 2. — A slice through the SDSS main galaxy sample, with galaxies color-coded based on rest-frame g — r color. The slice shows 
galaxies within ±4 degrees of the Celestial Equator, in the north Galactic cap. The redshift limit is smaller than in Figure [T] to better 
reveal details of structure. The large structure cutting across the center of the map is the "Sloan Great Wall" (Gott et al.;!2005') discussed 



effects of redshift distortions from spatial correlations, it 
is customary to estimate the galaxy correlation function 
on a two-dimensional grid of pair separations parallel (jr) 
and perpe ndicular (r v ) to the line of sight. Following the 
notation of Fi sher et al.l ((1994J ) , for a pair of galaxies with 
redshift positions vi and V2, we define the redshift sep- 
aration vector s = vi — V2 and the line-of-sight vector 
1 = |(vi + V2). The parallel and perpendicular separa- 
tions are then 

7r=|s-l|/|l|, r p 2 = s-s-7T 2 . (1) 

To estimate the pair counts expected for unclustered ob- 
jects while accounting for the complex survey geometry, 
we generate volume-limited random catalogs with the de- 
tailed angular selection function of the samples. For the 
different galaxy samples, we use random catalogs with 
25-300 times as many galaxies, depending on the vary- 
ing number density and size of the samples. We have 
verified that increasing the number of random galaxies 
or replacing the random catalog with another one makes 
a negligible differ ence to the measu r emen ts. We estimate 
£(r p ,7r) using the lLandv fc Szalavl (J1993D estimator 

, DD-2DR + RR 

&„,*) = m , (2) 

where DD, DR and RR are the suitably normalized num- 
bers of weighted data-data, data-random and random- 
random pairs in each separation bin. We weight the 
galaxies (real and random) according to the angular se- 
lection function; because we are using volume-limited 



samples, we do not weight by a radial selectio n function. 
We al so te sted the alternative £ e stimators of lHamiltonl 
(|1993[ ) and lDavis fc Peebles! (|1983f ) and found only small 
differences in the measurements. See Appendix [B] for 
these and other tests of our standard analysis procedures. 
To examine the real-space correlation function, we fol- 
low standard practice and compute the projected corre- 
lation function 



w p (r p ) = 2 / dn£,{r pi n). 
Jo 



(3) 



In practice, for most samples we integrate up to 7r max = 
60 h~ l Mpc, which is large enough to include most corre- 
lated pairs and gives a stable result by suppressing noise 
from distant, uncorrelated pairs. For samples with low 
outer redshift limits we use 7r max = 40/i _1 Mpc (see Ta- 
blesQ]and[2]). We use these 7r max values consistently when 
performing HOD modeling of the clustering results (not 
including the small residual effects of redshift-space dis- 
tortions). We use linearly spaced bins in -k with widths 
of 2/i _1 Mpc. Our bins in separation r p are logarith- 
mically spaced with widths of 0.2 dex. We checked the 
robustness to binning in r p and it and found our results 
to be insensitive to either. The measurements are quoted 
at the pair-weighted average separation in the bin. We 
estimate that this separation varies by at most 1% from 
the r p for which w p (r p ) equals the pair- weighted aver- 
age of w p in the bin. This corresponds to a change of the 
same magnitude in w p , significantly smaller than the sta- 
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Fig. 3. — Like Figure [2] but with galaxies color-coded by absolute magnitude. The size of the dots is also proportional to galaxy 
luminosity. As expected for a flux-limited survey, more luminous galaxies dominate at larger redshifts. 
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Fig. 4. — The distribution in redshift and r-band absolute magnitude for the SDSS sample with the imposed flux limits. We plot a 
random subset of the SDSS galaxies, sparsely sampled by a factor of 10. The lines show the magnitude and redshift ranges of the different 
volume-limited samples used in this paper. Luminosity-bin samples are shown on the left and luminosity-threshold samples on the right. 
All luminosity-threshold samples have z m j n = 0.02, so a sample consists of the set o f gal axies above the horizontal line marking the M r 
threshold and left of the corresponding vertical line marking z ma x- As discussed in i|2.1l we K-correct all galaxy magnitudes to redshift 
z = 0.1, and we quote absolute magnitudes for h = 1. 
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tistical errors on the measurements, and an up to 0.5% 
shift in the best-fit correlation length. 

The projected correlation function can be related to 
the real-space correlation function, £(r), by 



w p (r p ) 



rdr£(r)(r 2 -r p 2 )- 1/2 



(4) 



(|Davis fc Peebles! [19 83) . In particular, for a power- law 
£( r ) = ( r / r o)~ 7 , one obtains 



w p (r p ) 



rljlr 



^'r(I), (5, 

allowing one to infer the best- fit power- law for £(r) from 
w p . Alternatively, one can invert w p to get £(r) inde- 
pendent of the power-law assumption. Here, however, 
we focus on w p itself, as this is the statistic measured 
directly from the data that is determined by the real- 
space correlation function. We note that equation ([5]) 
strictly holds only in the limit of integrating to infinity 
to obtain w p . For most of our measurements used in the 
fits, however, r p < 7r max /4, and this has a minimal ef- 
fect. In this paper, we focus on HOD modeling of the 
measurements (using the finite 7r max values consistently) 
and pro vide power- law fits only as qualitative guidelines, 
but see ICoil et all (2008) for a possible way to modify 
the power-law fitting. 

We estimate statistical errors on our different measure- 
ments using jackknife resampling, as in Z05. We define 
144 spatially contiguous subsamples of the full data set, 
each covering approximately 55 deg 2 on the sky. Our 
jackknife samples are then created by omitting each of 
these subsamples in turn. The error covariance matrix is 
estimated from the total dispersion among the jackknife 
samples, 

N-l N 
Covarfe, &) = — — J2& ~ m/ ~ &)> (6) 



8=1 



where N = 144 in our case, and & is the mean value of 
the statistic £ measured in radial bin i in all the samples 
(£ denotes here the statistic at hand, whether it is £ or 
w p ). In Z05 we used N = 104 for the smaller sample. The 
larger value here is chosen to enable better estimation 
of the full covariance matrix, while still allowing each 
e xcluded subvolume to be sufficiently large. 

iNorberg et al.l (|2009T ) have recently studied a variety of 
error estimators for dark matter correlation functions in 
N-body simulations, comparing internal methods such 
as jackknife and bootstrap to external estimates de- 
rived from multiple independent catalogs, each compa- 
rable in size to our L* galaxy samples. They find good 
agreement between jackknife and external estimates for 
the variance in w p (r p ) on large scales, with jackknife 
errors somewhat overestimating the externally derived 
errors on small scales (r p < 2 ft, -1 Mpc). Our own 
tests of the jackknife method on PTHalos mock catalogs 
(jScoccimarro fc Shethll2003 ). described by Zehavi et al. 
(2002, 2004) and Z05, show that it yields error and co- 
variance estimates similar to those derived from multiple 
independent catalogs (see Z05, Figure 2). In principle, 
covariance matrices derived from large numbers of realis- 
tic mock catalogs are preferable because they use larger 
total volumes and include cosmic variance on scales of the 



full survey (while jackknife or bootstrap estimates only 
include cosmic variance on the scale of individual sub- 
samples). However, the tests mentioned above suggest 
that jackknife estimates are sufficient for our purposes, 
and they are a far more practical tool when working with 
many subsamples of different clustering properties, as the 
mock catalog approach would require a new set of realiza- 
tions mimicking the clustering signal of each one. The 
INorberg et al.1 (|2009f ) tests suggest that parameter un- 
certainties derived using the jackknife errors should, if 
anything, be conservative. Because of potential noise or 
systematics in jackknife estimates of the full covariance 
matrix, we also present some model fits below that use 
only diagonal elements. 

2.3. HOD Modeling 

We interpret the clustering measurements in the frame- 
work of the halo occupation distribution (HOD), which 
describes the bias between galaxies and mass in terms 
of the probability distribution P(N\Mh) that a halo of 
virial mass Mh contains N galaxies of a given type. Our 
modeling effectively translates galaxy clustering mea- 
surements for each class of galaxies into halo occupation 
functions (N(Mh)}, the mean number of galaxies as a 
function of halo mass. Other aspects of the HOD — the 
form of P(N\(N}) and the galaxy distribution within ha- 
los — are specified by theoretical expectations. 

We adopt a spatially flat "concordance" ACDM cos- 
mological model with matter density parameter Q, m = 
0.25, and baryon density parameter Sit, = 0.045, con- 
sistent with recent determinations from the cosmic mi- 
crowave background (WMAP5; iHinshaw et al.l 120091: 
iDunklev et~al? 2009; K omatsu et aLl|2009l) supernova la 
(|Kowalski et all 120081; iKessler et al.|l2OO90 . and baryon 
acoustic oscillations (|Percival et alj 12010). Accordingly, 
we assume a primordial density power spectrum with 
fluctuations at 8 hr 1 Mpc scale of cr§ = 0.8. The Hub- 
ble constant we use is Hq — 70 km s _1 Mpc -1 , and we 
assume an inflationary spectral index n s = 0.95. Lower- 
ing £2 m to 0.25 has only a small effect on our clustering 
measurements (see Appendix |BJ) . 

Hydrodynamic simulations show that the most mas- 
sive galaxy in a halo typically resides at or near t he halo 
center (e.g.. lBerlind et a l. 2003; Simh a et al.l2009D . in ac- 
cord with the expectations of semi-analytic models (e.g., 
White fc Frenll[l99l _ JKauffmann. White, fc Guiderdonil 
1993t iColeet all 11994ft . For HOD parametrization, it 



is useful to separate the contributions of these central 
galaxies from those of the additional, satellite galaxie s 
in each halo (jKravtsov et al.l 12004 IZheng et aLll2005[ ). 
For samples of galaxies brighter than a given luminosity, 
the mean occupation function can be well characterized 
by a smoothed step function for the central galaxy and a 
power-law number of satellites increasing with halo mass. 
We model it in this work using the following form: 

(N(M h )) = (7) 

1 

2 

where erf is the error function erf (a:) = — — L e — * dt 
. The mean occupation function of the central galaxies 
(the left square brackets times the 1/2 factor) is a step- 
like function with a cutoff profile softened to account 



1 + erf 



logM fc -logAf n - 

Clog M 



M h -Mg 
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for the s catter between ga laxy luminosity and halo mass 
(see also lMore et al.ll2009t ). The mean occupation of the 
satellite galaxies (the second term in the right square 
brackets multiplied by the left square brackets times the 
1/2 factor) is a power-law modified by a similar cutoff 
profile. The five free parameters are the mass scale M m ; n 
and width cri og m of the central galaxy mean occupation, 
and the cutoff mass scale Mo, normalization M[, and 
high mass slope a of the satellite galaxy mean occupation 
function. 

This speci fic form is mo t ivated by the theoretical study 
presented in IZheng et al.1 (J2005I ) , and is iden t ical t o the 
five- parameter model a dopted in lZheng et al.l (|2007j ) (see 
also lZheng et al.ll2009l Appendix B). It is more flexible 
than the three-parameter model used in Z05, which has 
the same basic shape. The five-parameter model intro- 
duces two additional parameters to characterize the cut- 
off profiles of central and satellite galaxies, allowing ex- 
cellent descriptions of the (N(Mi l )) functions predicted 
by hydrodynamic simulations and semi-analytic models 
(jZheng et al.ll2005t ). 

Two characteristic halo masses come into play in the 
modeling, which set the mass scales of halos that host 
central galaxies and satellites. M m - m characterizes the 
minimum mass of a halo hosting a central galaxy above 
the luminosity threshold. The exact definition of M m j n 
can vary between different HOD parameterizations; in 
the form we adopt (eq. [7J), it is the mass for which half 
of such halos host galaxies above the luminosity thresh- 
old, i.e., (iV cen (M m i n )) = 0.5. It can also be interpreted 
as the mass of halos in which the median luminosity of 
central galaxies is equal to the luminosity threshold (see 
I Zheng et al.1 120071 for details, except that it was incor- 
rectly labeled as mean rather than median luminosity 
there). The second characteristic mass scale is Mi, the 
mass of halos that on average have one additional satel- 
lite galaxy above the luminosity threshold, defined by 
(iV sat (Mi)) = 1. Note that M x is different from M[ in 
equation ([7]), though it is obviously related to the values 
of M[ and M . 

As is common practice, the distributions of the oc- 
cupation number of central galaxies and satellite galax- 
ies are assumed to follow the nearest-integer and Pois- 
son distributio ns, respectively, consistent with theoret i- 
cal predictions dKravtsov et al-lfeOO^IZheng et al.ll2005t ). 
iBovlan-Kolchin et al.l (|2010f ) have recently argued that 
the distributions of subhalo counts in high mass halos be- 
come super- Poisson at high {N ssA ), but we expect such 
a distribution to have minimal quantitative impact on 
our clustering predictions. The spatial distribution of 
satellite galaxies within halos is assumed to be the same 
as that of the dark matter, w hich follows to a good ap- 
proxi mation an NFW profile (jNavarro. Frenk. fc White] 
1996). For the halo concent ration parame t er c(M h), we 
adopt the relation given by iBullock et al.l (|2001| ). mod- 
ified to be consistent with our halo definition that the 
mean density of halos is 200 times the background den- 
sity: c(M h ) = c Q (M h /M al )P(l + z)~\ where c = 11, 
P = -0.13, and M nl = 2.26 x 10 12 h,- 1 M Q is the nonlin- 
ear mass scale at z = 0. 

The two-point correlation function of galaxies in 
ou r model is c alculated using the method described 
by iTinker et al.l ()2005fl . specifically their "n^-matched" 



method, which improves the algorithm in I Zheng! (2004) 
by incorporating a more accurate treatment of the halo 
exclusion effect. The method, calibrated and tested us- 
ing mock catalogs, is accurate to 10% or better. We use 
the measured values of w p (r p ) with the full error covari- 
ance matrices. We also incorporate the observed number 
density of galaxies in each subsample as an additional 
constraint on the HOD model, with an assumed 5% un- 
certainty. That is, we form the x 2 as 

x 2 = (w p - wp^c-Vp - wpHK-n;) 2 /^, (8) 

where w p and n g are the vectors of the two-point cor- 
relation function and the number density of the sample, 
and C is the full covariance matrix. The measured values 
are denoted with a superscript *. 

We implement a Markov Chain Monte Carlo (MCMC) 
code to explore the HOD parameter space. At each point 
of the chain, a random walk is taken in the parameter 
space to generate a new set of HOD parameters. The 
step-size of the random walk for each parameter is drawn 
from a Gaussian distribution. The probability to accept 
the new set of HOD parameters is taken to be 1 if Xnew — 

Xold and ex P["(Xnow - Xold)] if Xnew > Xold' where Xold 

and Xnew are the values of % 2 for the old and new models. 
Flat priors in logarithmic space are adopted for the three 
parameters related to mass scales and flat priors in linear 
space are used for the other two HOD parameters. The 
length of the chain for each galaxy sample is typically 
10,000 and we find convergence by comparing multiple 
realizations of chains. 

Before turning to our observational results, it is worth 
noting the similarities and differences between HOD 
modeling and two closely related methods, conditional 
luminosity functions (CLF) and sub-halo abundance 
matching (SHAM). Each well defined class of galax- 
ies, e.g., a luminosity-bin or luminosity-threshold sam- 
ple, has its own HOD. A conditiona l luminosity function 
(jYang. Mo. fc van den Bosch] I2003J ) provides a global 
model of the full galaxy population, specifying the lu- 
minosity function at each halo mass. An HOD can be 
calculated from a CLF by integrating the latter over lu- 
minosity, and a CLF can be calculated from a series of 
HODs by smoothed differentiation. The virtue of the 
CLF is its completeness, but when fitting data it typically 
requires stronger prior assumptions, such as a functional 
form for the luminosity function itself and functional 
forms for the dependence of luminosity function parame- 
ters on halo mass. By contrast, the five-parameter HOD 
model used here is already flexible enough to provide a 
near-perfect description of theoretic al mod el predictions 
for luminosity-thres hold samples ([ Zheng ct al. 2005). 
The SHAM method (IConrov. Wechsler fc Kravtsovl2006l : 
IVale fc Ostrikerll2006f) assumes a monotonic relation be- 
tween the luminosity or stellar mass of a galaxy and the 
mass or circular velocity of its parent halo or subhalo; 
the method can be generalized to allow scatter in this 
relation. While an HOD model takes the space density 
and clustering of a galaxy population as input for pa- 
rameter fits, a SHAM model takes only the space density 
as input and predicts the clustering, effectively using a 
theoretical prior to specify the satellite occupation func- 
tion. SHAM models are remarkably successful at match- 
ing the Z 05 correlation functions of luminosit y-threshold 
samples (|Conrov. Wechsler fc Kravts ov 2006). 
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The correlation functions of sub-L* galaxies at low 
redshift are typically well de scribed by power laws 
on scales r v < 10 h' 1 Mpc (iTotsufi fe Kihara] 119691; 
!Peeblesl[l97l: iGott fe Turnerlll979l) . but deviations from 
a power l aw become progres sively stronger for brighter 
galaxies (Ze havi et al.l 12004 Z05: this paper) and at 
higher redshifts (jConrov. Wechsler fc Kravtsovll2006l and 
references therein). In the cont ext of HOD models, 
iWatson. Berlind. fc Zentneil (|2011f ) discuss the physical 
processes that lead both to an approximate power law 
correlati on function and to deviations from a power law 
(see also lBenson et al.ll2000L iBerlind fc Weinberg! I2002L 
and Appendix A of lZheng et alll2009[ ). 

3. DEPENDENCE ON LUMINOSITY 
3.1. Clustering Results 

We study the clustering dependence on luminosity us- 
ing sets of volume-limited samples constructed from the 
full SDSS sample, corresponding to different luminosity 
bins and thresholds. Details of the individual samples 
are given in Tables [T] and [2] and illustrated in Figure |U 

As a representative case, we show in Figure [5] the two- 
dimensional correlation function, £(r p ,7r), as a function 
of separations perpendicular, r p , and parallel, 7r, to the 
line of sight, calculated for the —20 < M r < — 19 sam- 
ple. In the absence of redshift-space distortions the con- 
tours would have been isotropic, a function only of to- 
tal separation (\Jr p 2 + tt 2 ). Redshift-space distortions 
enter in the line of sight direction and are clearly evi- 
dent in the plot. For small projected separations, the 
contours are elongated along the line of sight direction, 
reflecting the "fingers-of-God" effect of small-scale virial 
motions in collapsed objects. On larger scales, we see 
the compression caused by coherent large-scale stream- 
ing into overdense regions and out of underdense regions 
(|Sargent fc Turnerl [T977tlKaiserl[l987l) . 

We isolate real-space correlations by calculating the 
projected correlation function, w p {r p ), according to 
equation ([3]). Figure [5] shows the projected correlation 
functions obtained for the volume-limited samples de- 
fined by luminosity bins and by luminosity thresholds. 
For the luminosity bins, we find a pronounced depen- 
dence of clustering on luminosity for the bright sam- 
ples, with the more luminous galaxies exhibiting higher 
clustering amplitudes. The dependence on luminosity is 
more subtle for the fainter luminosity-bin samples, with 
little change for scales r p < Ihr 1 Mpc. Behavior for the 
luminosity thresholds is similar, with nearly identical cor- 
relation functions for the M r < —18.5 and < —19.5 sam- 
ples, a slow but significant increase in clustering strength 
moving to M r < —20.5 and M r < —21.0, then a rapid 
increase going to M r < —21.5 and M r < —22.0. Mea- 
surements for the luminosity-threshold samples are less 
noisy, and one can see that the shapes of w p (r p ) are 
similar for all samples at r p > 3ft -1 Mpc, while the 
brighter samples exhibit a stronger inflection in w p {r p ) 
at r p « 2hr x Mpc and a steeper correlation function 
at smaller scales. The error covariance matrices exhibit 
significant correlation between the measurements on dif- 
ferent scales, particularly for the relatively faint, smaller 
volume, galaxy samp les. Similar behavior is found by 
iMcBride et al.l (|2011h . Power- law fits for these cluster- 
ing measurements, using the measured data points for 




Fig. 5. — Contours of the galaxy correlation function as a func- 
tion of tangential separation r v and line-of-sight separation 7r, eval- 
uated for the -20 < M r < -19 sample in 2 ft, -1 Mpc bins. The 
heavy (blue) contour marks £,{r v ,ir) = 1; inner £ > 1 (blue) con- 
tours are spaced by 0.1 in log£ and outer £ < 1 (magenta) contours 
by 0.1 in £. The dotted contours denote values of 0.067 and 0.033, 
while the thick dashed contour marks £(r p , 7r) = 0. In the absence 
of redshift-space distortions, contours would be isotropic. The mea- 
sured contours show the compression at large scales caused by co- 
herent peculiar velocities and the elongation at small r p caused by 
"finger-of-God" distortions in collapsed structures. 



r p < 20 hr 1 Mpc, are presented in Tables [T] and [2] We 
include fits computed both with and without the off- 
diagonal terms in the covariance matrix (i.e., setting the 
off-diagonal terms to zero). We caution that when the 
power-law fit is an inadequate description of the data, 
as indicated by large x 2 /d.o.f. values, the ro and 7 un- 
certainties have limited meaning — fitting over different 
ranges of the data could produce different values of the fit 
parameters. Tests in Appendix iBl on the M r < —21 sam- 
ple show that changes to our analysis procedures cause 
tq and 7 changes that are smaller than (or at most com- 
parable to) the statistical error bars. 

At large scales, we expect the real-space galaxy cor- 
relation function to be a scale-independent multiple of 
the dark matter correlation function £, gg (r) — b 2 ^ mm (r), 
where the bias factor b g will differ from one class of galax- 
ies to another. For each sample, we calculate the best- 
fitting bias factor of the measured w p (r p ) with respect 
to that of the dark matter over the separation range 
4 ft -1 Mpc < r p < 30ft. -1 Mpc, using the full error co- 
variance matrix of the measurements. (The average of r p 
weighted by the inverse of the uncertainty in w p corre- 
sponds to a separation of ~ 8 ft -1 Mpc, which can be re- 
garded as a rough estimate of the effective radius for this 
fit.) The w p (r p ) predicted for the non-linear matter dis- 
tribution of our ACDM cosmological model is computed 
from the non-line ar power spectrum with the method of 
ISmith et ai]|2003l . Filled circles in Figure [7] show these 
bias factors b g (L) and b g {> L) for our luminosity-bin and 
luminosity-threshold samples. We refer to these below as 
"DM-ratio" bias factors. In Z05 we defined bias factors 
via the value of w p {r p ) at one representative separation, 
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TABLE 1 
Volume-limited Correlation Function Samples Corresponding to Luminosity Bins 



M r 




C-^min 


C2max 


^gal 


A"blue 


AW 


TO 


7 


x 2 

dof 


r d 


■yd 


n 


(Mr) 


^max 


-23 to 


-22 


30,900 


73,500 


10,251 


1,797 


8,452 


10.47 ± 0.25 


1.92 ± 0.03 


2.4 


10.40 ± 0.18 


1.94 ± 0.02 


0.004 


-22.22 


60 


-22 to 


-21 


19,900 


47,650 


73,746 


27,496 


46,249 


5.98 ± 0.11 


1.92 ± 0.02 


5.0 


6.30 ± 0.06 


1.88 ± 0.01 


0.111 


-21.32 


60 


-21 to 


-20 


12,600 


31,900 


108,629 


50,879 


57,749 


5.46 ± 0.15 


1.77 ± 0.02 


3.8 


5.80 ± 0.09 


1.75 ± 0.01 


0.530 


-20.42 


60 


-21 to 


-20 


12,600 


19,250* 


17,853 


8,103 


9,749 


4.82 ± 0.23 


1.87 ± 0.03 


2.5 


5.33 ± 0.13 


1.81 ± 0.03 


0.530 


-20.42 


10 


-20 to 


-19 


8,050 


19,250 


44,348 


25,455 


18,892 


4.89 ± 0.26 


1.78 ± 0.02 


3.8 


5.19 ± 0.13 


1.80 ± 0.02 


1.004 


-19.47 


60 


-19 to 


-18 


5,200 


12,500 


18,200 


13,035 


5,165 


4.14 ± 0.30 


1.81 ± 0.03 


2.3 


4.59 ± 0.18 


1.93 ± 0.04 


1.300 


-18.48 


10 


-18 to 


-17 


3,200 


7,850 


5,965 


4,970 


995 


2.09 ± 0.38 


1.99 ± 0.14 


2.0 


4.37 ± 0.37 


1.91 ± 0.08 


1.972 


-17.46 


40 



Note. — All samples use 14.5 < m r < 17.6. tq and 7 are obtained from fitting a power-law to w p (r p ) using the full error covariance matrices, 
while ro d and -y d are obtained when using just the diagonal elements. For all samples, the number of degrees-of-freedom (dof) is 9 (11 measured w p 
values minus the two fitted parameters), n is measured in units of 10~ 2 h 3 Mpc~ 3 . A handful of galaxies do not have well measured colors, so N h i ua 
and N re d do not sum to N ga \. The smaller —21 < M r < —20 sample, indicated with an asterisk, is limited to a smaller redshift range to avoid the 
effects of the Sloan Great Wall (see text). 



TABLE 2 
Volume-limited Correlation Function Samples Corresponding to Luminosity Thresholds 



A/™ ax 


C^max 


ATgal 


A"blue 


AW 


ro 


7 


x 2 

dot 


r d 


l d 


n 


TTmax 


-22.0 


73,500 


11,385 


2,145 


9,237 


10.71 ± 0.24 


1.91 ± 0.03 


3.2 


10.56 ± 0.17 


1.92 ± 0.02 


0.005 


60 


-21.5 


59,600 


39,456 


10,576 


28,876 


7.27 ± 0.14 


2.00 ± 0.01 


8.8 


7.68 ± 0.08 


1.94 ± 0.01 


0.028 


60 


-21.0 


47,650 


83,238 


30,159 


53,075 


5.98 ± 0.12 


1.96 ± 0.02 


6.1 


6.46 ± 0.06 


1.90 ± 0.01 


0.116 


60 


-20.5 


39,700 


132,225 


54,827 


77,395 


5.60 ± 0.12 


1.90 ± 0.01 


3.2 


6.01 ± 0.06 


1.85 ± 0.01 


0.318 


60 


-20.0 


31,900 


141,733 


62,862 


78,868 


5.54 ± 0.14 


1.83 ± 0.01 


3.8 


6.00 ± 0.09 


1.79 ± 0.01 


0.656 


60 


-20.0 


19,250* 


30,245 


12,733 


17,510 


5.24 ± 0.28 


1.87 ± 0.03 


1.2 


5.53 ± 0.13 


1.85 ± 0.02 


0.656 


60 


-19.5 


25,450 


132,664 


62,892 


69,770 


5.11 ± 0.17 


1.81 ± 0.02 


1.8 


5.37 ± 0.08 


1.81 ± 0.01 


1.120 


60 


-19.5 


19,250* 


51,498 


24,005 


27,491 


5.17 ± 0.27 


1.84 ± 0.03 


2.3 


5.36 ± 0.13 


1.85 ± 0.02 


1.120 


60 


-19.0 


19,250 


77,142 


39,554 


37,585 


4.86 ± 0.27 


1.85 ± 0.03 


3.2 


5.23 ± 0.12 


1.85 ± 0.02 


1.676 


60 


-18.5 


15,750 


58,909 


32,554 


26,355 


4.48 ± 0.33 


1.86 ± 0.04 


2.1 


5.33 ± 0.18 


1.83 ± 0.03 


2.311 


10 


-18.0 


12,500 


39,027 


23,159 


15,868 


4.10 ± 0.34 


1.85 ± 0.04 


1.8 


4.75 ± 0.17 


1.91 ± 0.04 


3.030 


40 



Note. — All samples use 10.0 < m r < 17.6. z m i n for the samples is 0.02. ro and 7 are obtained from fitting a power-law to 
w p (r p ) using the full error covariance matrices, while ro d and 7 d are obtained when using just the diagonal elements. For all 
samples, the number of degrees-of-freedom (dof) is 9 (11 measured w p values minus the two fitted parameters), n is measured 
in units of 10 — 2 h 3 Mpc -3 . The samples indicated with an asterisk are limited to a smaller redshift range to avoid the effects 
of the large supercluster (see text). 
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Fig. 6. — Projected correlation functions for volume-limited samples corresponding to different luminosity-bin samples (left) and 
luminosity-threshold samples (right), as labeled. Error covariance matrices are computed from jackknife resampling as described in the 
text. The error bars shown are the square root of the diagonal elements of these matrices. For visual clarity, only a subset of the threshold 
samples are plotted. 
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r p = 2.67 h 1 Mpc. Open triangles in Figure [7] show 
this measurement of "single-r p " bias factors for compar- 
ison. For the -21 < M r < -20 bin and the M r < -20 
and M r < —19.5 thresholds, we use the samples with 
c-Zmax = 19,250 km s^ 1 (see Tables [T] and [2]), for the 
reasons discussed in §3.21 below. Because b g (L) changes 
so rapidly between M r = — 21 and M r = —22, we have 
also divided the — 22 < M r < —21 bin into two half- 
magnitude bins and computed bias factors separately for 
each. The open circles, discussed further in M3.31 show 
large-scale bias factors derived from HOD model fits to 
the full projected correlation functions ( "HOD bias fac- 
tors"). 

In agreement with prev ious studies (|Norberg et al.l 
120011: iTegmark et all 120041: Z05), b g {L) is nearly flat 
for luminosities L < L*, then rises sharply at brighter 
luminosities. 16 Dotted and dashed curves in the left 
panel show the e mpirica l fits to b a (L)/b (L* ) prop osed 
bv iNorberg et al.l (|2001| ) and ITegmark et al.l (|2004| ). re- 
spectively, where we take as b g (L t ) the "DM-ratio" bias 
factor estimated for the —21 < M r < —20 luminosity 
bin us ing the large-scale w p (r p ) ratio. The lNorberg et all 
(2001) form appears to fit our measurements better, but 
the differences between the curves only become large for 
the —18.0 < M r < —19.0 sample, where the single- r p 
and DM-ratio bias factors differ noticeably, and where 
the tests discussed in £ 33.21 below suggest that cosmic 
variance fluctuations are still significant. The HOD bias 
factors are in good agreement with the "DM-ratio" ones. 

The luminosity-threshold samples allow more precise 
bias measurements, and they avoid binning effects that 
can influence the estimates of b{L) when it changes 
rapidly across a bin. The HOD and DM-ratio values 
of b g (> L) agree well for all luminosity-threshold sam- 
ples except M r < —18.0, where the HOD fit overpredicts 
the large-scale w p (r p ) measurements (see Figure [10] be- 
low). The HOD bias points are fit to 3% or better by the 
functional form 

b g (> L) x (ct 8 /0.8) = 1.06 + 0.2KZ/.L*) 1 - 12 , (9) 

where L is the r-band luminosity c orrected to z = 0.1 an d 
L* corresponds to M r = -20.44 (jBlanton et al.ll2003cD . 
Except for the M r < — 18 point, this formula also ac- 
curately describes the DM-ratio bias factors. The HOD 
and DM-ratio bias factors scale as a^ 1 to a near-perfect 
approximation, since at large scales £ gg = b g £ mm ex b 2 g a\. 
We consider equation (J9J to be our most robust esti- 
mate of the dependence of large-scale bias on galaxy lu- 
minosity, applicable over the range 0.16L* < L < 6.3L, 
(-22.5 < M r < -18.5). Fitting the DM-ratio bias values 
for the luminosity-bin samples yields 

b g (L) x (<7 8 /0.8) = 0.97 + 0.17(L/L*) 1M , (10) 

which is close to the formula derived by INorberg et al.l 
(|2001|) for 6j-selected galaxies, but has a slightly steeper 
rise at high luminosities. 

3.2. Tests of Cosmic Variance 

Our volume-limited, luminosity-bin samples span dif- 
ferent ranges in redshift (specified in Table [TJ , with in- 
trinsically brighter galaxies observed over larger volumes. 



16 For the Blanton ct al. (2003c ) luminosity function, the char- 
acteristic luminosity L„ of the Schcchter ( 1976) luminosity function 
fit corresponds to M r = —20.44. 



It is thus important to test for the robustness of the de- 
tected luminosity dependence to "cosmic variance" of the 
structure in these different volumes. (We follow common 
practice in referring to these finite- volume effects as cos- 
mic variance, though a more precise term woul d be "sam- 
ple variance" ;[ScottIlrednicki^&WhEi[ilM) Figured] 
compares projected correlation functions of adjacent lu- 
minosity bins when using their respective full volume- 
limited redshift range (points with error bars) and when 
restricting both to their common overlap range (lines). 
The overlap volume is similar to the full volume of the 
fainter sample (differing only because of the r > 14.5 
bright limit), so the filled points and solid lines are usu- 
ally in close agreement. 

The most significant cosmic variance effect on the mea- 
surements appears to be due to the Sloan Great Wall 
(SGW), a huge supercluster at z ~ 0.08, which is the 
largest coherent structur e detected in the SDSS (Fig. [2] 
see also lGott et al.1120051 ). Its distance places it right at 
the edge of the -21 < M r < -20 sample (see Tabled]). 
Its exclusion from this sample when limiting to the over- 
lap range with the —20 < M r < — 19 sample causes the 
decrease in clustering amplitude on large scales seen in 
the bottom-left panel. Hence, this structure also causes 
the flattening in the projected correlation function of 
this sample at large separations seen in Figure [6] Con- 
versely, restricting the brighter —22 < M r < —21 sample 
to the smaller overlap range accentuates the superclus- 
ter's dominance and gives rise to the increased clustering 
seen in the top- right panel (dashed line). Taken together, 
these results strongly suggest that the most reliable esti- 
mate of w p (r p ) for the — 21 < M r < — 20 luminosity bin 
comes from the SGW-excluded sample rather than the 
full sample. Brighter, larger volume samples are much 
less affected by the SGW, while fainter samples do not 
extend as far and are thus not affected. These results are 
very similar to those of an identical test performed with 
the smaller samples of Z05. 

We have performed similar tests with the luminosity- 
threshold samples, and we find an analogous effect of the 
SGW, mostly for the M r < —20 sample and, to a lesser 
degree, for the M r < —19.5 sample. We have also done 
tests where we have excluded specifically the SGW region 
with angular and redshift cuts, confirming its significant 
impact on the large-scale clustering measurement for 
these samples. It is striking that even with the full SDSS 
sample, the effect of the SGW is still significant, and one 
should use caution in interpreting clustering measure- 
ments for relatively large separations (r p > 5/i _1 Mpc) 
for the few specific samples whose redshift range extends 
just up to (and including) this structure. For this reason, 
in Tables Q] and [2] we also provide power-law fits of these 
samples when restricted to a redshift limit that excludes 
the SGW (cz max = 19,250 km s -1 , the same limiting 
redshift as for the -20 < M r < -19 and the M r < -19 
samples). The HOD parameters derived for these sam- 
ples are relatively insensitive to the choice of sample 
volume, and the correlation functions corresponding to 
these HOD models differ much less than the power-law 
fits (see, e.g., Fig 15 in Z05). This insensitivity reflects 
the constrained nature of HOD fits for a specified cos- 
mological model: there is little freedom within these fits 
to adjust the large-scale correlation amplitude relative 
to the more robust measurements at r p < 2 hr 1 Mpc, so 
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Fig. 7. — Bias factors for the luminosity-bin samples (left) and the luminosity-threshold samples (right). Filled circles show bias 
factors defined by the ratio of the measured w p (r p ) to the dark matter w p (r p ) predicted for our fiducial cosmological model over the 
range 4/i -1 Mpc < r p < 30/i _1 Mpc. Open triangles show the bias factors defined by this ratio for the single radial bin centered at 
r p = 2.67 h Mpc, as done previously by Z05. In addition to the luminosity-bin samples shown in Figure[6] the left panel includes b g (L) 
points for the half-magnit ude bins —21.5 < M r < —21.0 and —22.0 < M r < —21.5. Open circles show the bias factors inferred from HOD 
modeling as described in £13.31 the statistical errors on these estimates are smaller than the points, and we omit the m for visual clarity. I n 
the left panel, the dotted curve is a fit to projected correlation functions in the 2dFGRS, b g /b* = 0.85 + 0.15L/L, (Norbcrg et al. 2001), 
where we take b, = 6 g (L») = 1.14 to be the bias factor inferred from the dark-matter-ratio estimate in the —21 < M r < —20 luminosity 
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bg/b* = 0.85 + 0.15L/L* - 0.04(M - M,) j ITegmark et al.H2004D . The solid curve is the fit in cq. {TD). In the right panel, the solid curve is 
the fit to the HOD model bias factors, eq. <[Qj . The points locations on the magnitude axis correspond to the bin center (left) and threshold 
magnitude (right). Small horizontal offsets have been added to points for clarity. 



the HOD modeling just accepts the \ 2 penalty of missing 
the large-scale data points. 

Other large structures in the survey volume may af- 
fect the cluste ri ng m easurements in more subtle ways. 
iMcBride et al.l (|2011l) investigated the impact of such 
structures on the 3-point correlation function by ana- 
lyzing the residuals of individual jackknife samples. The 
2-point correlation function is much less sensitive to such 
effects, with individual deviations at the few percent level 
at most scales (C. K. McBride, private communication). 
Of course, it is the cumulative impact of these residuals 
that determines the jackknife error bars, so the fact that 
they exist does not in itself imply that er rors are larger 
than o ur estimates. The tests of Z05 and iNorberg et aT] 
(2009) suggest that jackknife error estimates for w p (r p ) 
are typicallly accurate or conservative (see ^2.2|) . An in- 
vestigation of the individual residuals also suggests that 
the jackknife errors follow a Gaussian distribution to 
a fair approximation, but with some noticeable skew- 
ness (C. K. McBrid e, private communication; see also 
INorberg et al.l [20 1 If) . However, we find only a small 
difference between measuring w p (r p ) from the full data 
sample and taking the median of the jackknife w p (r p ) 
values in each separation bin, which suggests that any 
non-Gaussianity will have minimal impact on our clus- 
tering results. 

Additional cosmic variance concerns have to do with 
the relatively small volumes associated with the lowest 
luminosity samples we consider. Figure [8] can shed some 



light on this issue as well. Specifically, the bottom-right 
panel checks the sensitivity to the volume probed in the 
correlation functions measured for the —19 < M r < —18 
and —20 < M r < — 19 luminosity bins. The general 
agreement of the curves (calculated in the overlap vol- 
ume) to the respective sets of points (calculated for the 
full volume-limited sample), within the measured uncer- 
tainties, is reassuring. We perform similar tests with the 
magnitude-threshold samples, looking at the robustness 
of the clustering measurements of different samples when 
limiting to the inner small volumes associated with the 
fainter thresholds. Here we find substantial volume ef- 
fects when considering the volume of the faintest thresh- 
old sample M r < —18, but very weakened effects for the 
M r < —18.5 and brighter thresholds (see Figure [9]). We 
are unsure why the effects for the M r < —18.0 sample 
appear larger than those for the —19.0 < M r < —18.0 
luminosity-bin sample, as they have the same outer red- 
shift limit; however, the galaxy populations are different 
in the two cases, with the luminosity threshold sample 
including also brighter galaxies. We find these finite- 
volume effects to be smaller than those found in the ear- 
lier samples of Z05, due to the larger sky coverage of 
SDSS DR7. 

The comparisons of solid and dashed curves in Fig- 
ure [8] provide the fairest test of luminosity dependence 
between the samples, as the effects of cosmic variance are 
essentially removed by matching volumes. For the fainter 
samples shown in the lower panels, the evidence for hum- 
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Fig. 8. — Check of finite volume effects ("cosmic variance") in the measured luminosity dependence of the correlation function. Each 
panel shows projected correlation functions of two adjacent luminosity bins in their full volume-limited range (symbols with error bars) 
and in their common overlap regions (lines). The solid line and filled symbols correspond to the fainter luminosity bin in each panel, while 
the dashed line and open symbols correspond to the brighter sample. Comparison of the dashed and solid lines in each panel tests for 
luminosity segregation between the two adjacent bins measured in a common volume. 



nosity dependence is marginal relative to the error bars. 
The detection is stronger in the upper right panel and 
overwhelming for the brightest galaxies in the upper left. 
The difference between the dashed line and the open 
points in this panel is plausibly explained by the small 
sample (~ 2600 galaxies) of —23 < M r < —22 galax- 
ies in the overlap volume: the larger volume of the full 
sample is required to give a robust measurement of large- 
scale clustering for these rare galaxies. These conclusions 
— evidence for increased clustering at M r sa —21.5 and 
dramatically increased clustering at M r w —22.5 — are 
consistent with the b(L) data points in Figure [71 

3.3. Modeling the luminosity dependence 

To investigate further the implications of the 
luminosity-dependent clustering, we turn to HOD mod- 
eling. We find the best-fit HOD models for our set of 



volume-limited luminosity-threshold samples, using the 
five-parameter model described in § 12.31 Figure [TU] shows 
the HOD best fits to the projected correlation functions 
(staggered by 0.25 dex for clarity). Here we use the full 
volume-limited samples, with no attempt to remove the 
SGW. The values of the fitted parameters, inferred us- 
ing the full error covariance matrix, are given in Table |3l 
We also list / sa t, the fraction of sample galaxies that are 
satellites from the HOD modeling results. We see that 
the HOD models provide reasonable fits to the projected 
correlation functions, with deviations from a power-law 
more apparent for the brighter samples. The character- 
istic inflections in w p (r p ) at r p = 1 — Ihr 1 Mpc arise 
at the transition from the small-scale, one-halo regime, 
where most correlated pairs come from galaxies in the 
same halo, to the large-scale, two-halo regime, where 
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Fig. 9. — Check of finite volume effects in the nearby low- 
luminosity threshold samples. The plot shows the projected corre- 
lation functions of the three faintest threshold samples in their full 
volume-limited range (symbols with error bars) and limited to the 
smaller overlap region of the adjacent fainter sample (lines). When 
limiting the M r < —18.5 sample (or brighter ones) to the volume 
of the M r < — 18 sample (solid black line), the correlation func- 
tion changes significantly and agrees with the latter measurement, 
while no such effect is detected when limiting the M r < — 19 (or 
brighter samples) measurements to the volume of the M r < —18.5 
sample (dashed blue line). 



the shape of £(r) approximat ely traces the shape of the 
matter correlation function ()Berlind fc Weinberg] l2002t 
IZehavi et al.ll2004T ). The x 2 values for these fits are also 
specified in the table and can be compared to the corre- 
sponding values for the power-law fits (Table [2]). In all 
cases the HOD model has a better goodness-of-fit than 
the best-fit power law model. We note, however, that the 
X 2 values still tend to be somewhat large, particularly for 
the bright samples. These might reflect uncertainties in 
the jackknife error covariance estimate, residual system- 
atics or a limitation of the restricted HOD model. For 
the M r < —18.0 sample, the HOD model overpredicts 
the amplitude of w p (r p ) at large scales, but the tests in 
Figure |9] suggest that the large scale clustering of this 
sample is significantly affected by the small sample vol- 
ume. 

The high-mass slope a of the satellite mean occupa- 
tion function is around unity for most samples. For the 
brightest sample (M r < —22.0), a is noticeably higher 
than unity, but with large error bars. The right panel of 
Figure [10] presents the halo occupation functions them- 
selves. When going toward brighter samples, the main 
effect is a shift of the halo occupation function toward 
higher halo masses, a shift that affects both the central 
galaxy cutoff and the satellite occupation. More lumi- 
nous galaxies occupy more massive halos, which leads 
to their stronger clustering. For the six fainter sam- 
ples, there are models with sharp central-galaxy cut- 
offs (aiogju = 0) that have Ax 2 < 1 compared to the 
best-fit model; we have chosen to plot these sharp-cutoff 
(N(M h )} curves in Figure \M For the M r < -21.0, 
M r < —21.5, and M r < —22.0 samples, however, a 
non-zero value of <7i og M, indicating scatter between halo 



mass and central galaxy luminosity, is required to simul- 
taneously fit the galaxy number density and projected 
correlation function. A sharper cutoff would predict an 
excessive clustering amplitude for the measured number 
density because of the rising b(Mh) relation. Figure [TT] 
illustrates the level of statistical uncertainty in the HOD 
fits, plotting (N(Mh)) for ten models randomly chosen 
from the MCMC chain that have Ax 2 < 1 relative to 
the best-fit model for each of three luminosity thresh- 
olds. The cutoff profile is generally better constrained 
for brighter samples because of the steeper form of b(Mh) 
at high Mh- The satellite occupations are tightly con- 
strained in all cases (other than a relatively large scatter 
in Mo for the M r < —20.5 sample). 

Figure [T2b shows the two characteristic halo mass 
parameters M min and Mi (see § I2.3f) as a func- 
tion of the threshold luminosity. Both halo mass 
scales increase with the sample's threshold luminos- 
ity, with a steeper dependence for brighter galax- 
ies. Because central galaxies dominate th e total num- 
ber d ensity for any luminosity threshold (|Zheng et al.l 
2005), the approximate form of the M m [ n curve fol- 
lows simply from m atching the space densities of galax- 
ies and halos (e.g.. IConrov. Wechsler fc Kravtsovl 120061 : 
IVale fc Ostrikerl l2006f) . In our HOD parameterization, 
•^min can be interpreted as the mass of halos in which 
the median luminosity of central galaxies is equal to the 
threshold luminosity. We propose the following form for 
the relation between median ce ntral galaxy lum inosity 
L ccn and halo mass Mh (see also lKim et al.ll2008h . 

W*.-^)™«p(-£ + i). <") 

where A, M t , and an are three free parameters. That 
is, the median central galaxy luminosity has a power-law 
dependence on halo mass at the high mass end (with 
a power-law index of ocm) and drops exponentially at 
the low mass end. The transition halo mass is char- 
acterized by M t , and the normalization factor A is the 
median luminosity of central ga laxies (in units of L y = 
1.20 x 10 w h~ 2 L Q in the r-band; iBlanton et alJfeOOSd ) in 
halos of transition mass. The fit (solid curve) shown in 
FigureQjk has A = 0.32, M t = 3.08 x 10 11 /i- 1 M o , and 
ajvf = 0.264. Figure WZb shows the Mh/L cen ratio as a 
function of halo mass. The solid curve is derived from 
the fit in Figure [i~2"b: 



Mh _ ( Mh 



M t 



(Mh 
\Mt 



exp 



M_ 
M h 



1 



(12) 



where {M h /L ccn ) Mt = 8O/iAf /L is the mass-to-light 
ratio in halos of transition mass. The transition mass 
(times 1/(1 — aj.f), to be exact) also marks the scale 
at which Mh/L ccn reaches a minimum. Halos of Mh ~ 
4.2 x 10 11 H~ 1 Mq are maximally efficient at convert- 
ing their available baryons into r-band light of their 
central galaxy. Other authors have reached a similar 
conclusion usi ng HOD , CLF, or SHAM methods (e.g 



2005 



2008 



Yang. Mo. fc van de n Bosc h] l2003t iTinker et al. 
Vale fc Ostrikerl2006l:IZheng et aU2007l; lKim etafl 
Guo ct al. 2010; M oster et alll2010D . 

In Figure WBa , the sharp upturn in Mh (M m ; n ) arises 
because the galaxy luminosity function drops exponen- 
tially in a regime where the halo mass function remains 
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Fig. 10. — Luminosity dependence of galaxy clustering and the HOD. The left panel shows the measured w p (r p ) and the best-fit HOD 
models for all luminosity-threshold samples. The samples are each staggered by 0.25 dex, starting from the M r < —20.5 sample, for clarity. 
The right panel shows the corresponding halo occupation functions, (AT(Af^)), color-coded in the same way. The occupation functions shift 
to the right, toward more massive halos, as the luminosity threshold increases. The separation of central and satellite galaxies is shown 
for the rightmost occupation function, corresponding to the brightest sample, as the dashed and dotted curves, respectively. For the six 
fainter samples, we have chosen models with sharp central-galaxy cutoffs (a"i og m ~ 0) that have A\ < 1 relative to the best-fit model 
listed in Table \3\ (see text). The three brightest samples require smooth cutoff profiles to fit the number density and clustering data. 



TABLE 3 
HOD and Derived Parameters for Luminosity Threshold Samples 



M™ ax 


log M min 


"logM 


log M 


log M[ 


a 


log Mi 


/sat 


b 9 


x 2 

dof 


-22.0 


14.06 ±0.06 


0.71 ±0.07 


13.72 ±0.53 


14.80 ± 0.08 


1.35 ±0.49 


14.85 ±0.04 


0.04 ±0.01 


2.16 ±0.05 


1.8 


-21.5 


13.38 ±0.07 


0.69 ± 0.08 


13.35 ± 0.21 


14.20 ±0.07 


1.09 ±0.17 


14.29 ±0.04 


0.09 ±0.01 


1.67 ±0.03 


2.3 


-21.0 


12.78 ±0.10 


0.68 ±0.15 


12.71 ±0.26 


13.76 ± 0.05 


1.15 ±0.06 


13.80 ±0.03 


0.15 ±0.01 


1.40 ±0.03 


3.1 


-20.5 


12.14 ±0.03 


0.17 ±0.15 


11.62 ±0.72 


13.43 ± 0.04 


1.15 ±0.03 


13.44 ±0.03 


0.20 ±0.01 


1.29 ±0.01 


2.7 


-20.0 


11.83 ±0.03 


0.25 ±0.11 


12.35 ± 0.24 


12.98 ±0.07 


1.00 ±0.05 


13.08 ± 0.03 


0.22 ±0.01 


1.20 ±0.01 


2.1 


-19.5 


11.57 ±0.04 


0.17 ±0.13 


12.23 ±0.17 


12.75 ± 0.07 


0.99 ± 0.04 


12.87 ±0.03 


0.23 ±0.01 


1.14 ±0.01 


1.0 


-19.0 


11.45 ±0.04 


0.19 ±0.13 


9.77 ±1.41 


12.63 ± 0.04 


1.02 ±0.02 


12.64 ±0.04 


0.33 ±0.01 


1.12 ±0.01 


1.8 


-18.5 


11.33 ±0.07 


0.26 ±0.21 


8.99 ±1.33 


12.50 ±0.04 


1.02 ±0.03 


12.51 ±0.04 


0.34 ±0.02 


1.09 ±0.01 


0.9 


-18.0 


11.18 ±0.04 


0.19 ±0.17 


9.81 ±0.62 


12.42 ±0.05 


1.04 ±0.04 


12.43 ±0.05 


0.32 ±0.02 


1.07 ±0.01 


1.4 



Note. — See Eq. [7] for the HOD parameterization. Halo mass is in units of 1i~ 1 Mq. Error bars on the HOD parameters 
correspond to ltr, derived from the marginalized distributions. Mi, / sa t and b g are derived parameters from the fits. Mi is 
the mass scale of a halo that can on average host one satellite galaxy above the luminosity threshold and / sa t is the fraction of 
satellite galaxies in the sample. b g is the large-scale galaxy bias factor and is degenerate with the amplitude of matter clustering 
as, so that this is in fact b g X (as/0.8). A 2% systematic shift in the w p values would correspond to a 1% change in b g , effectively 
doubling the tiny errorbars on it. For all samples, the number of degrees-of-freedom (dof) is 9 (13 measured w p values plus the 
number density minus the five fitted parameters). The parameters of the sharp-cutoff models plotted in Fig llOl for the six fainter 
samples (see text) are specified hereby as (M™ ax , logM min , <r logM , logM , logMj, a): (-18.0, 11.14, 0.02, 9.84, 12.40, 1.04); 
(-18.5, 11.29, 0.03, 9.64, 12.48, 1.01); (-19.0, 11.44, 0.01, 10.31, 12.64, 1.03); (-19.5, 11.56, 0.003, 12.15, 12.79, 1.01); (-20.0, 11.78, 
0.02, 12.32, 12.98, 1.01); (-20.5, 12.11, 0.01, 11.86, 13.41, 1.13). 



close to a power-law. The sharp rise in b(L) (Fig- 
ure [7]) is driven both by this upturn in M/, (M m j n ) 
and by the steepening of the b(Mh) relation itself 
(IMo fe Whitd Il99a Ijihgl [l998t ISheth. Mo. fe Tormenl 
I200U iTinker et all I2010I ). As discussed by Zheng et 
al. (2009, Appendix A), the greater departures from 
a power-law w p (r p ) evident for brighter galaxies arise 
mainly because -M m j n and Mi are larger than the char- 
acteristic halo mass M^ where the halo mass function be- 
gins to drop exponentially; this change in the halo mass 
function shape leads to a sharper transition between the 
one-halo and two-halo regimes of the correlation func- 
tion. 
There is a considerable gap between the values of M m - m 



and Mi at all luminosities. As in earlier works, we find 
an approximate scaling relation of Mi ~ 17M m j n , imply- 
ing that a halo hosting two galaxies (one central galaxy 
and one satellite) above the luminosity threshold has to 
be about 17 times more massive on average than a halo 
hosting only one (central) galaxy above the luminosity 
threshold. Halos in this "hosting gap" mass range tend to 
host more luminous (higher mass) central galaxies rather 
than multiple galax ies, consistent with the predictions of 
iBerlind et al.l (|2003t ) based on hydrodynamic simulations 
and semi-analytic models. As can be seen in Figure [T2"b. 
this scaling factor is somewhat smaller at the high lu- 
minosity end, corresponding to massive halos that host 
rich groups or clusters. This latter trend likely reflects 
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Fig. 11.— Uncertainties in the HOD fits for the M r < -19.5, 
—21.0, and —21.5 luminosity-threshold samples (left to right). For 
each model, the figure shows (N(Mf l )) for ten randomly selected 
models that have Ax 2 < 1 relative to the best-fit model. 



the relatively late formation of these massive halos, which 
leaves less time for satellites to merge onto central galax- 
ies and thus lowers the satellite threshold Mi. Physical 
effects that sh a pe th e Mi/M m ; n relation are discussed by 
IZentner et al.1 (|2005t l using analytic descriptions of halo 
and galaxy merger rates. 

Our results are cons istent with previou s measurements 
of these trends (Z05; IZheng et al.ll2007l ). Z05 found a 
slightly larger scale factor of « 23, likely because of slight 
differences in the HOD parameterizations a nd the corre- 
spond ing definitions of the halo mass scales. IZheng et al.l 
(|2007| ) found, for that same early SDSS sample but us- 
ing the current HOD model, Mi « 18M m ; n , in excellent 
agreement with our results for the final SDSS sample. 
These results are also in agreement with predictions of 
galaxy formation models. In particular, the scale factor 
in the M m j n — Mi scaling relatio n is in good agreem ent 
with the predictions presented by IZheng et al.l ()20Q5f ) . 

Returning to Figure [7J open circles in the right-hand 
panel show the values of b g {> L) corresponding to our 
best-fit HOD models, i.e., the asymptotic bias on large 
scales where the bias factor is scale-independent. These 
bias estimates necessarily depend on the assumptions 
associated with our HOD modeling, principally that 
(JV(Mfi)) has the form defined by equation ([Jj and that 
(N(Mh)) is independent of a halo's large-scale environ- 
ment (no "assembly bias"). These assumptions allow us 
to use constraints from smaller scale clustering and the 
galaxy number density, greatly reducing the error bars 
on b g {> L) and reducing the sensitivity to cosmic vari- 
ance in the large-scale clustering. Given the significant 
finite-volume variations that remain even in SDSS DR7 
( §3.2)1 . we consider these HOD-based b g (> L) values to 
be our most robust estimates of the luminosity depen- 
dence of galaxy bias, despite their dependence on an as- 
sumed model. 

To obtain HOD-based bias factors for luminosity-bin 
samples, we have taken the central and satellite occupa- 
tion functions for each bin to be simply the difference 



of the occupation functions for the bracketing threshold 
samples, yielding the open circles in the left panel of Fig- 
ure [71 The right panel of Figure [T3] shows the resulting 
mean occupation functions, and the left panel compares 
the w p (r p ) predicted by these threshold-difference HODs 
to the observed w p (r p ) from Figure El After fitting the 
luminosity-threshold correlation functions, there are no 
parameter adjustments made to fit the luminosity-bin 
data. The agreement is generally good. For the faintest 
luminosity bin —19 < M r < —18 on large scales, the 
model slightly overpredicts the observed w p (r p ) (with 
X 2 = 20 for 13 data points). This tension could indi- 
cate that our HOD model does not allow a good de- 
scription of galaxies in this luminosity range, or it could 
be that our jackknife method underestimates the cosmic 
variance uncertainties for this small-volume sample. We 
have already noted that the HOD model of the bracket- 
ing M r < —18.0 sample overpredicts its observed large- 
scale clustering (FigurefTO]). and that this sample appears 
to have significant finite- volume effects (Figure [9]) . We 
revisit this overprediction in £14.51 below, where we sepa- 
rately examine the clustering of red and blue galaxies in 
this luminosity bin. 

4. DEPENDENCE ON COLOR 

4.1. Auto- Correlation of Blue and Red Galaxies 

The clustering of galaxies is known to depend on other 
galaxy properties in addition to luminosity, such as color, 
surface brightness and profile shapes. These latter quan- 
tities are correlated with e ach other and prod uce sim- 
ilar trends in w p (r p ) (e.g., IZehavi et al.l l2002h . More- 
over, galaxy luminosity and color have been shown to 
be the two properties most predictive of galaxy environ- 
ment (jBlanton et al.1 [200 5a). such that any residual de- 
pendence on morphology or surface brightness at fixed 
luminosity and color are weak. We focus in this section 
on the color dependence of galaxy clustering using our 
luminosity-bin samples. Figure[5]qualitatively illustrated 
the clustering differences between blue and red galax- 
ies. To study this difference quantitatively, we divide 
our sample into "blue" and "red" galaxies according to 
the well-kn own color bimodality in the color-magn itude 
plane (e.g. JStrateva et all2001llBaTdrv et al.ll2004j) . Fol- 
lowing Z05, we use a magnitude-dependent color cut de- 
fined by 

(<? ~ r)cut = 0.21 - 0.03M r . (13) 

This tilted cut, shown below in Figure [T8l appro- 
priately separates the red E/S0 ridgeline from the 
blue cloud, following the division into two populations 
as a functi on of luminosit y . An iden tical color cut 
is used by ISwanson et al.l (12008D and McBrid e et aP 
(1201 ll) while other wo rks (e.g.. lBlanton fc Berlindll2007t 
iSkibba fc~S heth 20091) use a very slightly modified divi- 
sion. Our results are not sensitive to the exact choice of 
the cut. 

While color most directly measures star formation his- 
tory, it can also be viewed as a proxy of morphology, 
where blue galaxies are mostly spirals, and red galax- 
ies tend to be spheroid dominated. (The two classifi- 
catio n schemes are certainly not identical, however; see , 
e.g.. IChoi. Park, k, Vogelev) l2007t iBamford et~all l2009t 
iBlanton fc Moustakasl 120091: ISkibba et al.l 1200911 Fig- 
ure [14] shows £(r>,7r) separately for blue and red galax- 
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Fig. 12. — Panel (a): characteristic mass scales of halos hosting central galaxies and satellites as a function of the sample threshold 
luminosity. Open symbols show the M m i n value s, w hile filled symbols are the Mi values. The solid curve is a simple parametrized fit to 
M m i n as a function of threshold luminosity (eg. 1111 ). The dotted curve denotes the solid curve scaled up by a factor of 17, representing 
the Mi fa 17M m ; n scaling relation. Panel (fe): ratio of halo mass to median central galaxy luminosity as a function of halo mass. The solid 
curve is derived from the fit in panel (a) (see eq. 1121 ). 
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Fig. 13. — HOD models and predictions for the projected correlation functio n of luminosity-bin samples. The HOD for each bin is set 
to the difference of the HODs for the bracketing luminosity thresholds (see Fig. llOII . with no further adjustments to fit the luminosity-bin 
data. The right panel shows (JV(Mj,)) separately for the central (dashed lines) and satellite (dotted lines) galaxies. The left panel shows 
the corresponding model predictions together with the w p (r p ) data from Fig. [6] The samples are each staggered by 0.5 dex, starting from 
the —21 < M r < —20 sample, for clarity. 



ies, for a representative case of the —20 < M r < — 19 
volume-limited sample. The difference between the two 
populations is striking. The red galaxies exhibit a sub- 
stantially higher clustering amplitude and much stronger 
fmger-of-God distortions on small scales, as seen in the 
elongation along the 7r direction for small r p separations. 
These differences reflect the expected color-density re- 
lation, with red galaxies residing in more massive ha- 
los that have a stronger bias and higher velocity disper- 
sions. The large-scale coherent distortion is more appar- 
ent in the blue sample. In linear theory, the c oherent 
disto r tion depends on the parameter /3 ~ Q^/b (|Kaiserl 
Il987t iHamiltonl 119981) , as a lower bias implies a larger 



gravitational perturbation for a given galaxy overdensity. 
The blue galaxies are better tracers of the 'field' and are 
less biased and thus exhibit a stronger large-scale com- 
pression and only a weak fmger-of-God distortion. The 
£(r p ,7r) diagram of the full —20 < M r < —19 sample 
(Figure [5]) is, of course, intermediate between these two. 
These results are in qualitative agreement with previous 
SDSS measure ments (Z05) and with DEEP2 measure- 
ments at z ~ 1 (|Coilet al.ll2008t) . 

Figure [15] shows the corresponding projected correla- 
tion functions, w p (r p ). The correlation function for the 
red sample has a higher amplitude and steeper slope 
than the blue sample. Fitting power laws results in 
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Fig. 14. — Contours of the galaxy correlation function as a function of tangential separation r p and line-of-sight separation n for the 
-20 < M r < — 19 sample, evaluated separately for red galaxies (left) and blue galaxies (right). Contours are the same as in Fig. [5] 



a correlation length of r$ = 6.63 ± 0.41 h 1 Mpc and 
slope 7 = 1.94 ± 0.03 for the red galaxies, versus ro = 
3.62 ± 0.15 h- 1 Mpc and 7 = 1.66 ±0.03 for the blue (see 
Table |3| . These trends are similar for all the luminosity 
samples, but the differences in clustering are weaker with 
increasing luminosity, as is shown in § 14.31 

4.2. Cross- Correlation of Blue and Red Galaxies 
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Fig. 15. — Projected correlation functions of red and blue galax- 
ies in the —20 < M r < —19 luminosity bin. Red triangles and 
blue squares show the auto-correlation functions of the red and 
blue subsamples, respectively, while open black circles show the 
auto-correlation of the full sample. Filled green circles show the 
projected cross-correlation function of the red and blue galaxies. 
The black solid line shows the geometric mean of the red and blue 
auto-correlations for comparison. 



The auto-correlation functions of red and blue galaxies 
separately do not include blue-red galaxy pairs. Another 
useful measurement is then the cross-correlation between 
blue and red galaxies. In the large-scale, linear bias ap- 
proximation, where <5 rcd = b rc d$ m and <5 b iuc = &biue<5m, 
the cross-correlation must be the geometric mean of the 
auto-correlations. To the extent that halos have corre- 
lation coefficient r = Oim/V^mmUih — 1 with the mat- 
ter distribution, the geometric mean result should hold 
throughout the two-halo regime, even if the halo bias is 
scale-dependent and the matter field is non-linear. On 
small scales, in the one-halo regime, the cross-correlation 
encodes information on the mixing of galaxy populations 
within the halos. Any tendency of red or blue galaxies to 
segregate from one another will be reflected as a devia- 
tion of the cross-correlation function from the geometric 
mean. For example, if some halos contained only red 
galaxies while other halos contained only blue galaxies, 
this would depress the number of one- halo pairs and push 
the cross-correlation function below the geometric mean. 
However, the prediction of the cross-correlation function 
has a number of subtleties; we discuss these issues and 
provide some simplified estimates in Appendix [A] We 
also show in this Appendix that the cross-correlation 
function of two galaxy populations is mathematically 
determined if one knows the auto-correlation of the in- 
dividual populations and of the combined population; 
nonetheless, the cross-correlation presents this implicit 
information in a more intuitive form. 

We measure the cross-correlation function of the blue 
and red galaxy samples in an analogous way to the auto- 
correlations, using the Landy-Szalay estimator. Specif- 
ically, we use equation ^ with D\D^ replacing DD, 
R1R2 replacing RR and D1R2 + -D2-R1 replacing 2DR, 
with the subscripts denoting the two cross-correlated 
subsamples. Error bars are obtained similarly via jack- 
knife resampling. Filled green circles in Figure [15] show 
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the resulting cross-correlation function for the —20 < 
M r < — 19 sample. On large scales, as expected, we 
find that the cross-correlation result follows the geomet- 
ric mean of the blue and red auto-correlations. On small 
scales (for r p < Ihr 1 Mpc) we find that the cross- 
correlation falls below the geometric mean, possibly indi- 
cating a slight segregation of blue and red galaxies within 
the halos. This deviation is significant given the small er- 
ror bars on these scales. Note, however, that this is very 
far from suggesting a full segregation into "red halos" 
and "blue halos". That extreme case would lead to no 
one-halo contribution at all, making the projected cross 
correlation approximately flat for r p < 2 /i -1 Mpc. 

We find similar behavior for the cross-correlation of 
red and blue galaxies in all of our luminosity subsam- 
ples. However, the depression of the cross-correlation 
below the geometric mean is stronger for the relatively 
faint samples and smaller for brighter galaxies (consis- 
tent with Z05, who showed the cross-correlation function 
for an M r < —21 gala xy samp l e). O ur results are also 
in agreement with Wa ng et al.l (|2007l ). who investigated 
in detail the cross-correlation between galaxies of differ- 
ent lumin osities and color u sing a n earlier SDSS sample, 
and with iRoss k, Brunnerl ([2009D , who measured angu- 
lar clustering of an SDSS photometric sample. A similar 
depression of the cr oss-correlat i on be low the geometric 
mean is observed bv lCoil et al.1 ( 20081). 

Using an SDSS group catalog. fWeinmann et al.l ([2006) 



find that the colors of satellite galaxies are correlated 
with those of their central galaxy. However, this trend, 
which they termed "galactic conformity", is found to 
have roughly the same strength independent of luminos- 
ity, so its connection to our findings is unclear. It is 
known that the fraction of red galaxies that are satel- 
lites becomes larger with decreasing luminosity (e.g., 
Z05; see also related discussions in the following sub- 
sections). Thus, the luminosity-dependent suppression 
of the cross-correlation function in the one-halo regime 
may be simply related to the relative paucity of blue 
gala xies co mpared to red on es within large halos (see 
also Ivan den Bosch et al . 2008b; Ha nsen et al.l 120091 ) . In 
future work, we will model the cross-correlation results 
with HOD in detail, and study the implication of these 
measurements for the distribution of red and blue galax- 
ies within dark matter halos. 

4.3. Joint Dependence on Color and Luminosity 

We now turn to the luminosity dependence of clus- 
tering within the red and blue galaxy populations indi- 
vidually, using the luminosity-dependent color division 
of equation (fT"3]). Figure [TBI shows projected correlation 
functions for the volume-limited luminosity-bin samples, 
separately for the red (left panel) and blue (right panel) 
galaxies. Figure [17] shows the correlation length r$ and 
slope 7 of power-law fits to these samples. Because some 
of the samples are quite small, making jackknife esti- 
mates of the covariance matrix noisy, we fit using the 
diagonal error bars only, which is enough to capture the 
trends visible in the w p (r p ) plots. Figure Q7] also shows ro 
and 7 from diagonal fits to the full luminosity-bin sam- 
ples. The differences between the different color samples 
are particularly distinct for the fainter samples, and they 
decrease with increasing luminosity. 

These plots display the same general trends seen in 



previous sections: the large-scale clustering amplitude 
increases with luminosity for both red and blue popula- 
tions, and red galaxies generically have higher clustering 
amplitude and a steeper correlation function. Within the 
individual populations, however, the luminosity trends 
are remarkably different. The projected correlation func- 
tions of the blue galaxies are all roughly parallel, with 
slopes 1.6 < 7 < 1.8, and the amplitude (or correla- 
tion length) increases steadily with luminosity. For the 
red galaxies, on the other hand, the shape of w p (r p ) is 
radically different for the two faintest samples, — 18 < 
M r < — 17 and — 19 < M r < —18, with a strong inflec- 
tion at r p ss 3 h~ l Mpc indicating a high-amplitude one- 
halo term. These two samples have the strongest small- 
scale clustering, matched only by the ultra-luminous, 
—23 < M r < —22 galaxies. The large-scale clustering (at 
r p w 5 — 10/i _1 Mpc) shows no clear luminosity depen- 
dence until the sharp jump at the —23 < M r < —22 bin, 
though it is consistent with a weak but continuous trend 
at lower luminosities. Power-law fits yield significantly 
steeper power-law slopes for the correlation functions of 
the two faintest samples, together with a mild increase 
in the correlation lengths (see Fig.[l7|). We caution that 
the — 18 < M r < — 17 sample is very small, containing 
only about 5000 blue galaxies and 1000 red galaxies, and 
might be sensitive to cosmic variance; we have not used 
it in earlier sections but include it here to show the ex- 
tension of the luminosity trends to the faintest galaxies 
we can effectively study. 

The strong clustering of intrinsica lly faint red 
galax i es has been previ ously ob served dNorberg et all 
2001 iHogg et all l200l Z05; iSwanson et all 120081: 



Cresswell fc Percivalll2009h . We build on these studies, 



confirming this intriguing clustering signal and present- 
ing its most significant measurement obtained with the 
largest redshift sample available. The red galaxy samples 
analyzed here include ~ 25, 000 galaxies below L„, about 
6, 000 of them in the two faintest bins, more than triple 
the size of the samples studied in Z05. The strong clus- 
tering is an indication that most of t he faint red ga laxies 
are s atellites in fairly m assive halos (Bcr lind et all 2005; 
Z05; IWanget al]|2009h . We present HOD models of a 
few of these samples in 8 34.51 below but defer a detailed 
examination of this population to future work. 

4.4. Auto-Correlation of Finer Color Samples 

The large size of the DR7 main galaxy sample al- 
lows us to measure w p {r p ) for narrow bins of color in 
addition to the broad "blue" and "red" classifications 
used in !j4.H -i |4.3l and in most earlier work. Figure [18] 
shows the cuts we adopt to divide galaxies into "bluest" , 
"bluer" , "redder" and "reddest" populations. We also 
define an intermediate population of "green" galaxies, 
located near the minimum of the observed color bimodal- 
ity along the red/blue dividing line, ass ociated with 
the so-called "green v alley" galaxies (e.g., IWvder et all 
I2007t lLoh et allfeOlOf) . We include all galaxies within 
A(g - r) = 0.05 of the tilted dividing line of Eq. [JJ 
(analogous to t he "green" galaxy population studied by 
ICoil et al.l2008l ) . In addition, we add a sample of galaxies 
along the cusp of the red sequence galaxies, denoted as 
"redseq" , defined as all galaxies within A(<? — r) = 0.03 of 
the redder/reddest dividing line. Note that the last two 
classes are not distinct populations: the "green" sample 



Galaxy Clustering in the Completed SDSS Redshift Survey 



21 





; 1 1 


1 ' : 




- 1 






RED - 


1000 


— 


\ 








— 


o 


" ^^S?^ ^^S 


N 




; 


S 


- ^*nK\ 






_ 


Ifl 100 


r ^^ 




- 


a 
U 


: -23<M r <-22 t^ 
I -22<M r <-21 


s 




: 


* 10 


_-21<M r <-20 
E -20<M r <-19 






C\"= 




: - 19<M r <-18 


" VPv 




- -18<M r <-17 


% " 



0.1 





: 


M i i ' 


- 






BLUE 


- 


^ 1000 

o 
a 






! 


£. 10 ° 

a 

a 

^ 10 






; 










\i 



10 



0.1 



10 



r p (h 1 Mpc) 



r p (h ' Mpc) 



Fig. 16. — Projected correlation functions for different luminosity-bin samples, shown separately for red galaxies (left) and blue galaxies 
(right). For clarity, the brightest and faintest blue samples have been omitted from the plot, as their correlation functions are noisy. 
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Fig. 17. — Luminosity and color dependence of the galaxy correlation function. The plots show the correlation lengths (left) and 
slopes (right) corresponding to the real-space correlation function obtained from power-law fits to projected correlation functions using the 
diagonal errors. These are shown for the blue, red and full populations of the luminosity-bin samples. Points are plotted at the luminosity 
of the bin center, divided by L*, which is taken to be M r = —20.5. 



contains a subset of the redder and bluer samples, while 
the "redseq" sample contains a subset of the the redder 
and reddest samples. Details of the individual samples 
are given in Table |U 

Figure [19] shows the projected correlation functions of 
all these color samples, for the representative luminos- 
ity bin — 20 < M r < —19. We find a continuous trend 
with color, in both amplitude and slope: the redder the 
color of the sample, the higher and steeper the correla- 
tion function. We find the same trends in the other lumi- 



nosity bins, although the dependence on color is weaker 
at higher luminosities, as seen already for the red/blue 
division in Figure 1171 

Differences in clustering strengths should be reflective 
of the different environments of the galaxies. The steady 
trend of w p (r p ) with col or at fixed lumino sity is consis- 
tent with the findings of iHogg et al.l (|2003f l. who inves- 
tigated the density of galaxy environments as a func- 
tion of luminosity and color. The trend across our three 
red samples indicates that redder galaxies within the 
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Fig. 18. — A color-magnitude diagram for the SDSS galaxies, 
showing r-band absolute magnitudes vs. g — r colors. A random 
subset of the galaxies is plotted, sparsely sampled by a factor 10. 
The tilted lines denote the different color samples used. The solid 
lines denote the division into "bluest" , "bluer" , "redder" , and "red- 
dest" subsamples, respectively with increasing color. The dashed 
lines mark the boundary o f th e "green" population along the main 
red- blue dividing line (Eq, ll3H . The dotted lines indicate the "red- 
seq" galaxy population along the locus of the red sequence. The 
latter two populations are not independent of the previous ones: 
The "green" galaxies include some of the "bluer" and "redder" 
galaxies. Similarly, the red sequence "redseq" population is com- 
prised of some of the "redder" and "reddest" galaxies. 



TABLE 4 

Color Subsets of the Volume-Limited —20 < M T < —19 

Sample 



Sample 


^gal 


it 


ro 


7 


2 
X 
dof 


All 


44,348 


1.004 


4.89 ± 0.26 


1.78 ± 0.02 


3.79 


Red 


18,892 


0.428 


6.63 ± 0.41 


1.94 ± 0.03 


5.07 


Blue 


25,455 


0.576 


3.62 ± 0.15 


1.66 ± 0.03 


1.66 


Reddest 


10,278 


0.233 


7.62 ± 0.42 


2.07 ± 0.03 


1.87 


Redseq 


7,542 


0.171 


7.23 ± 0.28 


1.95 ± 0.03 


1.06 


Redder 


8,614 


0.195 


5.48 ± 0.43 


1.91 ± 0.04 


1.84 


Green 


5,543 


0.126 


5.06 ± 0.42 


1.79 ± 0.05 


1.35 


Bluer 


11,156 


0.253 


4.14 ± 0.21 


1.69 ± 0.04 


0.89 


Bluest 


14,299 


0.324 


3.15 ± 0.15 


1.71 ± 0.05 


1.10 



Note. -— All samples use 14.5 < m r < 17.6. cz m i n = 
8,050km s~ x and cz max = 19,250km s _1 . fi is measured in 
units of 10~ 2 h 3 Mpc -3 . The number of degrees-of-freedom 
(dof) is 9 (11 measured w p values minus the two fitted param- 
eters). The subsamples are defined using tilted color cuts as 
described in the text. 



red sequence populate denser regions, ag ain consistent 
with lHogg et~aII(|2003H . iHogg et all (|2004f i examined the 
color-magnitude diagram as a function of environment 
and did not find a significant shift of the red sequence lo- 
cation with density, but examining their results in detail 
does reveal mild changes in the l ocus for bulg e-dominated 
galaxies. The trends observed in lHogg et atl (J2003ll200l 
a re subtle, but the y appear consistent with our results. 
iCoil et ail (|2008l) have carried out an analysis similar 
to ours at z ~ 1, using projected correlation functions 
of fine color bins in the DEEP2 galaxy survey. They 
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Fig. 19. — Projected correlation functions for various color sub- 
samples of the —20 < M r < — 19 volume-limit ed s ample. Color 
cuts are as defined in the text and shown in Fig. 1181 



find qualitatively similar results for blue galaxies and for 
the difference between blue and red galaxy clustering, 
but they find no significant change in the amplitude or 
slope of w p (r p ) among their red samples (see their fig- 
ure 12). The difference from our results could be a conse- 
quence of details of sample definition, or possibly a con- 
scque nce of color-depen dent inco mpleteness i n D EEP2 
(e.g., iGerkeet all 120071) . though ICoil et all (|2008t ) ac- 
count for this in their analysis. The difference could also 
be an evolutionary effect reflecting the buildup of galax- 
ies on the red sequence. One plausible explanation is that 
variations in star formation history and dust content con- 
taminate and scatter galaxies within the red sequence, 
and from the "blue cloud" into the red sequence , when 
the universe is younger fe.g. jBrammer et al.ll2009l) . while 
evolution to z = allows galaxy populations to separate 
more cleanly, yielding a tighter correlation between color, 
stellar population age, and environment. 

The clustering of the green galaxies falls between that 
of the blue and red galaxy samples and clearly follows 
the continuous trend with color in both amplitude and 
slope. We do not find the apparent break in the green 
galaxies' project ed correlation function seen in DEEP2 
(|Coil et aT1l2008|) , where the clustering amplitude is sim- 
ilar to that of blue galaxies on s mall scales and to that of 
the red galaxies on large scales. lLoh et all (J2010I ) inves- 
tigate the clustering properties of "green valley" galaxies 
using UV imaging from GALEX matched to SDSS spec- 
troscopy, and find that the clustering of green galaxies is 
intermediate between that of the blue and red galaxies, 
in qualitative agreement with our results. However, they 
find that the green galaxies have a large-scale clustering 
ampli tude similar to that of the blue galaxies (in contrast 
with lCoil et al.ll2008f ). When fitting an overall power- law 
to the projected correlation function, they find the green 
galaxies' clustering amplitude to be between that of the 
blue and red samples, with a similar slope to that of the 
red galaxies, while we find the green galaxy population 
to be intermediate in both amplitude and slope. These 
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differences may have to do with the different sample def- 
inition and selection in each of these and warrant further 
investigation. 

4.5. Modeling the Color Dependence 

To model the color dependence of w p (r p ) presented in 
£14.41 we adopt a simplified HOD model based on the pa- 
rameterized form of the mean occupation function spec- 
ified in equation (jjj) for luminosity-threshold samples. 
For the —20 < M r < — 19 luminosity bin, we set the 
central galaxy occupation function to the difference of 
the M r < —19 and M r < — 20 modeling results shown 
in § 13.31 For simplicity, we also fix the slope of the 
satellite occupation function, a, to 1. We also assume 
that the occupation number of satellites at fixed halo 
mass follows a Poisson distribution and is independent 
of the central galaxy occupation number. The model- 
ing is thus a one-parameter family, in which only M[ 
is varied to fit w p {r p ), changing the relative normaliza- 
tion of the central and satellite occupation functions with 
color. The overall normalization is determined by match- 
ing the observed number density of galaxies in the color 
bin. In this simple model, the relative fraction of blue 
and red satellites has no dependence on halo mass. Dif- 
ferent modeling approaches and more detailed parame- 
terizations ar e poss ible, of course (e.g., [S cranton 2002; 
Cooravl 120051: Z05: iRoss fc Brunnerl l2009t ISimon et al.1 
2009; Skib ba fc Shethll2009D . but this form is sufficient 
to explain the main trends of the color dependence. We 
note that our model guarantees that the sum of central 
galaxy occupation functions of independent color sam- 
ples equals that of the full — 20 < M r < — 19 bin sample. 
By construction, the sum of the satellite mean occupa- 
tion functions, each of which follows a power law with 
soft cutoff, differs in shape slightly from the bin-sample 
satellite occupation, which is the difference of two power 
law curves with soft cutoffs. We have verified, however, 
that the sum in our fits is close to the satellite mean oc- 
cupation function of the overall bin sample, especially in 
the range where the occupation number is close to unity 
and the contribution to the small scale clustering signal 
is dominant. 

Figure [20] presents the results of this modeling. Points 
with error bars in the upper-left panel are the w p (r p ) 
measurements for fine color bins repeated from Figure [19l 
with 0.25-dcx offsets added between bins for visual clar- 
ity. The curves show the model predictions correspond- 
ing to the best-fit HODs, exhibited in the upper-right 
panel. Going from bluer galaxies to redder galaxies, 
the number of central galaxies steadily decreases and 
the number of satellite galaxies steadily increases. Al- 
though the central-to-satellite ratio is the only tunable 
parameter in our simplified HOD model, this is sufficient 
to explain the main trends observed in Figure [19] going 
from bluer to redder galaxies, the large-scale amplitude of 
w p (r p ) increases, the correlation function steepens, and 
the inflection at the one-to-two-halo transition becomes 
stronger. Table [5] lists the best-fit HOD parameters and 
X 2 values. We find x 2 /d.o.f. of 0.5 — 1.3 for most of 
the color samples, the exception being the bluest sam- 
ple, which has x 2 /d.o.f.^ 2. The fits can be improved by 
adding flexibility to the HOD model; for example, the fit 
for the bluest galaxies can be improved by allowing the 
slope of the satellite occupation function and the halo 



concentration to change. 

The lower panels of Figure [20] display the trends of 
satellite fraction more clearly. In the lower-right panel, 
we scale each occupation function by a constant factor so 
that the central galaxy components have the same nor- 
malization. The amplitude of the satellite occupation 
function increases steadily going from the bluest galax- 
ies to the reddest galaxies. The lower-left panel plots the 
satellite fraction / sat of each color bin against the median 
halo mass of galaxies in that bin. The satellite fraction 
rises from ~ 15% for the bluest bin to ~ 75% for the 
reddest bin, and the median halo mass increases as the 
fraction of galaxies that are satellites in massive halos 
grows. Green-valley galaxies have occupation functions 
intermediate between the red and blue galaxies, consis- 
tent with the idea that they are a transitio nal population 
(e.g., ICoil et ah 2008; Martin et al.1l2007l ). 

As discussed in ^3.31 the trend of clustering strength 
with luminosity is explained principally by a rise in the 
central galaxy halo mass, and the satellite fraction drops 
with increasing luminosity because the halo mass func- 
tion steepens at higher masses. In contrast, the trend 
with color at fixed luminosity can be explained with a 
constant halo mass for central galaxies and a steady in- 
crease of satellite fraction with redder color. The in- 
crease in typical host halo mass leads to an increase in 
the large-scale bias factor and thus a higher clustering 
amplitude at large scales. However, increasing / sat drives 
the one-halo term up more rapidly than the bias factor, 
so the correlation function steepens for redder galaxies 
as well. The success of our simple HOD model does not 
rule out a shift in central-galaxy halo mass for redder 
galaxies, but explaining the strong observed color trend 
solely through the central galaxy occupation would re- 
quire placing moderate luminosity red galaxies at the 
centers of very massive halos, and it might well be im- 
possible to match the clustering and number density con- 
straints simultaneously. 

Returning to the joint dependence on color and lumi- 
nosity ( J4.3[) . Figure [21] presents HOD model fits to the 
blue and red galaxy populations for three of the luminos- 
ity bins shown in Figure 1161 We use the same modeling 
approach adopted above for the fine color bins: we dif- 
ference the central galaxy occupation functions of two 
luminosity-threshold samples to get the central galaxy 
occupation function of the luminosity bin, fix the satel- 
lite slope to a = 1, and vary only the relative central 
and satellite normalizations within each population to fit 
the red and blue w p (r p ) measurements. With the other 
HOD parameters fixed previously by fitting the correla- 
tion functions of the full luminosity-threshold samples, 
this modeling has just one adjustable parameter within 
each luminosity bin. The model explains the rather com- 
plex color-luminosity trends from Figure [16] fairly well. 
In particular, it is able to reproduce the small-scale clus- 
tering of red galaxies increasing towards low luminosities, 
both in absolute terms and relative to the large scale 
clustering, while the shape of w p (r p ) for the blue galax- 
ies stays roughly constant. The fraction of red galaxies 
that are satellites increases sharply with decreasing lu- 
minosity, from 33% to 60% to 90% in the three luminos- 
ity bins, while the fraction of blue satellites (13%, 19%, 
19%) is smaller and only weakly dependent on luminos- 
ity. The precise values of the satellite fractions depend 
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Fig. 20. — HOD models of the correlation function in fine color bins of the —20 < M r < — 19 sample. See text for description of the HOD 
modeling. The top-left panel shows the measured w p (r p ) and the best-fit HOD models. Offsets of 0.25 dex are added for visual clarity, 
with the bluest galaxies at the bottom. The top-right panel presents the corresponding mean occupation functions, (iV^M;,)), color-coded 
in the same way, with dashed and dotted lines showing contributions of central and satellite galaxies, respectively. The bottom-right panel 
shows the same halo occupation functions normalized so that their central galaxy occupation functions coincide. The bottom-left panel 
shows the satellite fraction versus median halo mass for these color subsamples. Each colored "streak" shows results for models acceptable 
at the Ax 2 < 1 level; since the models have only one adjustable parameter, the uncertainty in this parameter produces a one-dimensional 
locus in this two-dimensional plane. 



on the HOD parameterization used to fit w p (r p ), but the 
general trend is robust: most blue galaxies at these lu- 
minosities are central, and the satellite fraction for red 
galaxies is higher and increases towards faint luminosi- 
ties. 

The largest quantitative failure of this model is its over- 
prediction of the large-scale w p (r p ) for the faintest red 
galaxies (and, to a smaller extent, for the faintest blue 
galaxies). It could be that our jackknife method under- 
estimates the errors for this small-volume sample, and 
we have already noted (Figure [T3|) that our HOD model 
overpredicts the total (red+blue) galaxy correlation func- 
tion in this luminosity bin. However, this discrepancy 
could indicate a limitation of our restricted HOD pa- 



rameterization. To investigate this possibility, we have 
considered models for the faint red galaxy population in 
which we vary the satellite slope a, the concentration pa- 
rameter of red galaxies in halos, and, most notably, the 
satellite cutoff parameter Mq in equation ([7]) . 

The dot-dashed curves in the upper panels of Fig- 
ure [5T] show an example in which red satellites arise only 
in halos above 10 13 h~ 1 MQ, reducing the satellite frac- 
tion from 90% in our original fit to 34%, thereby low- 
ering the large-scale bias factor. The physical motiva- 
tion for such a model is that gas accretion (and sub- 
sequent star formation) by a satellite system might be 
shut off only if it enters a halo whose mass is much 
larger than the "birth" halo in which it was a cen- 
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TABLE 5 

HOD and Derived Parameters for —20 < M r < — 19 

Fine-Color Subsamples 



Sample 


log M[ 


/norm 


/sat 


log M mcd 


x 2 

dof 


Reddest 


12.11 ±0.06 


0.10 


0.75 ± 0.03 


12.61 ±0.07 


1.3 


Redseq 


12.39 ±0.05 


0.11 


0.62 ± 0.03 


12.15 ±0.09 


1.2 


Redder 


12.67 ±0.04 


0.18 


0.46 ± 0.02 


11.80 ±0.02 


1.0 


Green 


12.87 ±0.05 


0.14 


0.34 ± 0.02 


11.70 ±0.01 


0.9 


Bluer 


13.11 ±0.03 


0.33 


0.24 ± 0.01 


11.65 ±0.01 


0.5 


Bluest 


13.36 ±0.05 


0.47 


0.15 ±0.01 


11.62 ±0.01 


2.0 



Note. — The shape of the mean occupation function for central 
galaxies is assumed to be the difference of those of M r < — 19 and 
M r < —20 samples. The mean occupation function for satellites 
follows a modified power law. The relative normalization of the mean 
occupation functions for central and satellite galaxies is determined 
by M [ . The overall normalization /norm is obtained from matching 
the observed number density (see text). Halo mass is in units of 
h~ 1 MQ. The satellite fraction, / sa t, and the median mass of host 
halos, M me( j, are derived parameters. For all samples, the number 
of degrees-of-freedom (dof) is 12 (13 measured w p values minus one 
fitted parameter). 



tral galaxy (ISimha et al J 120091: see alsolFont et all 120081 : 
iKang fc van den Bosch! 120081: Iskibbal |2009|). The lower 
satellite fracti on of this model i s more consistent with 
the results of IWang et al.l (|2009f ). who argue, based on 
group catalogs, that 30-60% of faint red galaxies (signifi- 
cantly fainter than those modeled here) are central rather 
than satellite galaxies. The fit to the smallest scale data 
points is impro yed by increasing the galax y concentra- 
tion parameter (|van den Bosch et al.ll2008al ) to twice the 
dark matter value, steepening the profile of the one-halo 
term. Visually, the w p (r p ) prediction of this model is 
not much better than that of the original model, but 
the x 2 /dof drops from 3.3 to 1.4 (since the mid-range 
points where the deviation is largest are the most covari- 
ant), a large statistical improvement. Despite its flexi- 
bility, this model underpredicts w p {r p ) in the one- halo 
regime and overpredicts it in the two-halo regime, em- 
phasizing how difficult it is to simultaneously reproduce 
the strong small-scale clustering and low large-scale bias 
factor of this galaxy sample. This tension could be a 
sign of environment-dependent effects on the HOD, but 
the expected form of "assembly bias" for low mass ha- 
los, putting the redder central galaxies into older, more 
clustered halos, would exacerbate the discrepancies with 
the data further. 

Overall, our inferences from HOD modeling accord well 
with the theoretical predictions of Berlind et al.l ([2005), 
who compared the r esults of cosmological SPH simula- 
tions to Hogg et al.'s (|2003l ) measurements of galaxy envi- 
ronmen ts as a function of lu minosity and color. In par- 
ticular, iBerlind et al.l (|2005[ ) find that the environment 
of satellite galaxies in the simulations is strongly corre- 
lated with stellar population age (hence color), and that 
for low and intermediate luminosities the environmen- 
tal dependence of the ov erall galaxy p o pulat ion tracks 
that of satellite galaxies. IBerlind et al.l (|2005h also find 
that the great majority of faint red galaxies in the sim- 
ulation are satellites, though the simulation they use to 
study this population is small. The success of our simple 
HOD models in reproducing the observed color depen- 
dence of w p (r p ) contrasts with the recent conclusions of 



iRoss fc Brunn er (2009), who find that some segregation 
of early- and late-type galaxies into separate halos is re- 
quired to reproduce their measured angular clustering of 
an SDSS photometric galaxy sample, which extends to 
smaller separations. 

5. CONCLUSIONS AND PROSPECTS 

The SDSS galaxy redshift survey allows high-precision 
clustering measurements for a broadly selected galaxy 
sample with extensive, high quality photometric informa- 
tion. We have examined the luminosity and color depen- 
dence of the galaxy correlation function in the DR7 main 
galaxy sample, which includes approximately 700,000 
galaxies over 8000 deg 2 , with a median redshift of ~ 0.1. 
This is the largest sample used to date for such stud- 
ies, by a factor of several. Furthermore, the DR7 main 
galaxy sample is likely to remain the definitive low red- 
shift galaxy survey for many years; other ongoing and 
planned s urveys, including th e BOSS survey of LRGs in 
SDSS-III (|Schiegel et al.ll2009r i. will probe larger volumes 
and higher redshifts, but they will not target a wide range 
of galaxy types in the present-day universe. Our anal- 
ysis focuses on the projected auto-correlation functions 
calculated for volume-limited samples defined by lumi- 
nosity and color cuts, with measurements tabulated in 
Appendix [C] We use HOD modeling to interpret these 
measurements in terms of the relation between galaxies 
and dark matter halos, assuming a ACDM cosmological 
model with Q m = 1 - A = 0.25, Q b = 0.045, h = 0.7, 
n s = 0.95, and cts = 0.8. 

The amplitude of w p (r p ) increases with increasing 
galaxy luminosity, slowly for L < L* and rapidly for 
L > L t , where £» corresponds to M r — —20.44 
(|Blanton et alj|2003d : we quote absolute magnitudes for 
h= 1 throughout the paper). For L < L*, w p (r p ) is rea- 
sonably described by a power-law at r p < 10 h^ 1 Mpc, 
while brighter samples show clear and increasingly strong 
inflections at r p w 1 — 3 h^ 1 Mpc. We find similar trends 
for samples defined by luminosity bins and by luminos- 
ity thresholds. The large-scale bias factor of luminosity- 
threshold samples is well described by the fitting formula 
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Fig. 21. — HOD model fits to the project ed c orrelation functions of red and blue galaxy populations in three luminosity bins, as labeled. 
Points with error bars are taken from Fig. 1161 Solid curves in the left hand panels show w p (r p ) for the best-fitting models. In the right 
hand panels, dashed and dotted curves show the mean occupation functions for central and satellite galaxies in the red (thick line) and 
blue (thin line) populations. In the upper panels, dot-dashed lines indicate an alternative fit (in which more parameters are varied) for the 
faint red population. See text for details of the modeling procedure. 



b g {> L) = 1.06 + 0.21(L/L*) 112 . For luminosity-bin 
samples, we find b g (L) = 0.97 + O.tfjL /L*) 1 - 04 , simi- 
lar to the luminosity dependence found bv iNorberg et all 
(|2001l ) for 6j-selected galaxies in the 2dFGRS. 

At fixed luminosity, the redshift-space correlation func- 
tion of red galaxies exhibits stronger "finger-of-God" 
distortions than that of blue galaxies, while the blue 
galaxies exhibit stronger large-scale, coherent flow distor- 
tions. The projected correlation function of red galaxies 
is steeper and higher in amplitude. The cross-correlation 
of red and blue galaxies is equal to the geometric mean of 
the auto-correlation functions on large scales, but it falls 
slightly below the geometric mean for r p < XhT 1 Mpc. 
Adopting fine color bins, we find a continuous trend of 
clustering with color: the bluest galaxies have a shal- 
low, low-amplitude correlation function, the clustering 



of "green valley" galaxies is intermediate between that 
of blue and red galaxies, and the reddest galaxies have a 
(slightly) steeper correlation function than galaxies that 
trace the ridge of the red sequence. We present detailed 
results for the —20 < M r < —19 luminosity bin, but we 
find similar trends in other bins where our statistics are 
good enough to measure them. 

The luminosity dependence of clustering for the red 
and blue populations is strikingly different. For blue 
galaxies, the amplitude of w p (r p ) increases slowly but 
steadily with luminosity over the range M r — —18 to 
M r = —22, with nearly constant shape. For red galax- 
ies, there are only weak luminosity trends over the range 
-22 < M r < -19. The -23 < M r < -22 galaxies 
have a much higher correlation amplitude and a strong 
break in w p (r p ) at r p sa 2h~ Y Mpc. Most remarkably, 
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the small-scale (r p < 2hT x Mpc) correlation function of 
the —19 < M r < —18 red galaxies is equal to that of 
the —23 < M r < —22 red galaxies, a factor of 2 — 3 
higher than that of intermediate luminosity red galaxies. 
Red galaxies with — 18 < M r < — 17 show even stronger 
small-scale clustering, though our survey volume for such 
low luminosity systems is small. 

Our HOD modeling shows that these varied trends in 
the amplitude and shape of w p (r p ) can, for the most 
part, be well explained by the combination of ACDM 
cosmology and physically plausible recipes for the rela- 
tion between galaxies and dark matter halos. The lu- 
minosity dependence of w p (r p ) arises from an overall 
shift in the mass scale of the mean occupation function 
(N(Mh)). The halo mass M m ; n for hosting central galax- 
ies of luminosity L rises with luminosity. Correspond- 
ingly, the central galaxy luminosity increases with halo 
mass as L/L* = A(M min /M 4 ) QM exp(-M t /M min + 1), 
where A = 0.32, M t = 3.08 x 10 11 /i~ 1 M Q , a M = 0.264, 
and -Mmin is the halo mass at which the median lumi- 
nosity of central galaxies is L. The mass Mi at which 
halos host an average of one satellite above luminosity 
L follows a similar trend: we find M\ w 17M m ; n over 
most of our luminosity range, with a smaller factor at the 
highest luminosities. We find substantial scatter (rj 0.3 
dex) between halo mass and central galaxy luminosity for 
L > L*, while fits for lower luminosities are consistent 
with little or no scatter. 

The color dependence of w p (r p ) at fixed luminosity can 
be explained well by a change in the relative fractions of 
central and satellite galaxies. In our best-fit models of 
the —20 < M r < —19 bin, for example, the satellite 
fraction rises steadily from 15% for the bluest galaxies to 
75% for the reddest galaxies. Increasing the satellite frac- 
tion increases the large-scale bias factor by placing more 
galaxies in high mass halos, and it produces a steeper 
correlation function with a stronger inflection by boost- 
ing the one-halo term relative to the two-halo term. A 
modest offset in the halo mass scale for central red and 
blue galaxies is physically plausible, but our models are 
able to fit the main observed trends without such off- 
sets, and it is unlikely that the central-galaxy mass scale 
can be the primary driver of the observed color trends in 
w p {r p ). 

Differences in satellite fractions largely explain the dif- 
ferent luminosity dependence of w p (r p ) for red and blue 
galaxies. However, within our standard parameterization 
we are unable to find a statistically acceptable fit to the 
clustering of the red — 19 < M r < — 18 galaxies. After 
adjusting the model to allow red satellites only in rel- 
atively high mass halos (M/j > lO 13 /i _1 Af0, a factor of 
100 above Af m i n ), we do find a statistically acceptable fit, 
but even this model underpredicts the one-halo term of 
w p {r p ) while overpredicting the two- halo term. The dif- 
ficulty in reproducing w p (r p ) for faint red galaxies could 
signify a breakdown of our assumption that the HOD is 
independent of large-scale environment, but the obvious 
forms of environment dependence (redder central galax- 
ies in older halos) go in the wrong direction. Clearly 
the clustering of the faint red galaxy population merits 
further study. While we do not know of any planned red- 
shift surveys that will provide better statistics for such 
low luminosity galaxies, cross-correlation of photometric 
samples with redshift samples of more luminous galax- 



ies may allow higher precision clustering measurements 
from a larger effective volume. 

Our measured luminosity and color trends agree 
with those found in earlie r studies, most notably the 
iNorberg et all (|200lL I2002T ) studies of the 2dFGRS and 
the Z05 study of SDSS DR2, but the SDSS DR7 sam- 
ple allows measurements of higher precision, greater 
detail, and wider dynamic range. Our conclusions 
about the luminosity and color dependence of galaxy 
halo occupations are gene rally consistent with those 
found in earlier studies ( e. g.. Ivan den Bosch et alj|2003at 
iCollister fc Lahavl [20051 : lYang et al.1 l2005bt Z05V al- 
though the greater precision and dynamic range of our 
clustering measurements allows us to examine this de- 
pendence in substantially greater detail. Even with 
SDSS DR7, the tests in £13.21 reveal significant finite- 
volume effects for samples with limiting absolute magni- 
tude M r s=s — 20, which extend just far enough to enclose 
the Sloan Great Wall, and for samples with limiting mag- 
nitude M r > —18, which have small total volume. These 
effects have a significant influence on the (ro, 7) values of 
power-law fits to these samples (and their color-defined 
subsamples) . They have little impact on the best- fit val- 
ues of HOD parameters, though they do affect the \ 2 
values of HOD fits. The finite- volume uncertainties limit 
the strength of our conclusions about the faint red galaxy 
population. 

Our modeling in this paper derives HOD parameters 
for well-specified classes of galaxies defined by thresh- 
olds or bins in luminosity and divisions in color. The 
related formalism of conditional luminosity functions 
(|Yang. Mo. fc van den Bosch! 120031 ) seeks to provide a 
continuous description of the dependence of the galaxy 
luminosity function on halo mass. In a subsequent paper 
(Zheng et al., in preparation), we will present a general- 
ization of this approach to luminosity-color distributions 
and apply it to our w p {r p ) data, resulting in a compre- 
hensive model that synthesizes the information from all 
of the measurements presented here. 

Our HOD parameterization is flexible enough to de- 
scribe the predictions of galaxy formation models accu- 
rately (|Zheng et al.ll2005l ). and with an assumed cosmo- 
logical model the w p {r p ) measurements are themselves 
sufficient to provide tight constraints on HOD param- 
eters. Studies of other real-space clustering measures, 
such as the multiplicity function of groups, the three- 
point correlation function, the topology of isodensity sur- 
faces, and the void probability function can test the HOD 
models presented here, perhaps revealing breakdowns of 
this parameterization that would point to new aspects of 
galaxy formation physics. Most interesting would be to 
find evidence for environmental variations of the HOD, 
as this would tie observable galaxy properties to features 
of halo formation history that correlate with large-scale 
environment at fixed halo ma ss. Conversely, limits on 
environmental var iations (e.g.. iBlanton fc Berlindl 120071 : 
iTinker et alJl2008l ) limit the degree to which galaxy prop- 
erties can be driven by quantities such as halo formation 
time or concentration. 

Uncertainties in cosmological parameters within the 
range allowed by other data have little impact on 
our conclusions. The largest effect is that changes 
to &7 m or as would shift the mass scale of the HOD 
(|Zheng &: Weinberg] l2007f ). The combination of w p (r p ) 
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constraints with dynamical measures that are sensitive 
to the halo mass scale allows novel constraints on these 
cosmological parameters. Efforts in this direction are un- 
derway, using cluster mass-to-light ratios, galaxy-galaxy 
lensing, and redshift-space distortions. We expect these 
an alyses, toge t her w ith the cluster abundance analysis 
of iRozo et al.l (|2010t ). to yield tight independent con- 
straints on erg and VL m with several systematic cross- 
checks. These constraints, based on the inferred am- 
plitude of dark matter clustering, are complementary to 
those derived fr om the large-sca le shape of the galaxy 
power spectrum (jReid et al.ll2010| ) , which can themselves 
be sharpened by using HOD modeling to account for th e 
effects of scale-dependent galaxy bias (|Yoo et al.ll2009l ). 
The combination of these constraints with those derived 
from CMB data, Type la supernovae, baryon acoustic 
oscillations, and other cosmological observables will al- 
low stringent consistency tests of the ACDM cosmologi- 
cal model, at the few-percent level. Surveys of the next 
decade will extend many of these techniques to higher 
redshifts, but the SDSS maps of structure in the present- 
day universe still have much to teach us about galaxy 
formation and the physics of the cosmos. 
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APPENDIX 

A. THE CROSS-CORRELATION FUNCTION OF GALAXIES 

A.l The Relation Between the Cross-correlation and Auto- correlation Functions of Galaxy Samples 

IZu et all (|2008f ) consider the general case of the relation between the two-point auto-correlation functions of a galaxy 
sample and the auto- and cross-correlation functions of its subsamples (see their Appendix). Here we focus on the 
specific case of the correlation functions of red, blue and all galaxies. The point we make here is that only three 
of the four correlation functions (blue-blue, red-red, all-all auto-correlation functions, and red-blue cross-correlation 
functions) are independent. That is, if we measured blue-blue, red-red, and all-all auto-correlation functions, there 
would be no new information from the red-blue cross-correlation functions. 

To demonstrate that this is the case, we recall that the two-point correlation function represents a galaxy pair count. 
The total number of pairs of all galaxies in the parent sample is simply the sum of the numbers of red galaxy pairs, 
blue galaxy pairs, and red-blue galaxy pairs. That is, 



'all 



il + Call) 



'bll 



.(!+£ 



bll 



= ) + ^ n red(l + Cred) + ™bluc«Tcd(l + £c 



.), 



(Al) 



where n a \\, n rc( j, and nbi uc are the mean number density of the parent sample and the red/blue subsamples, £ a n, £biuej 
and £ rc( j are the two-point auto-correlation functions, and £ cross is the red-blue two-point cross-correlation function. The 
factor of 1/2 in front of the auto-correlation terms is to avoid the double count of auto-pairs. Since n a \\ = n IC & + n-biuc, 
the above identity reduces to 
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The same relation holds for projected correlation functions w p . Thus the red- blue cross-correlation function can be 
derived from the three auto-correlation functions. 

As a test of this relation, we predicted the red-blue cross-correlation function based on the measured all-all, red-red, 
and blue-blue auto-correlation functions for the —20 < M r < — 19 volume-limited galaxy sample. The prediction 
agrees essentially perfectly with the measured cross-correlation function shown in Figure 1151 with deviations much 
smaller than the la error bars. 
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There is thus, in theory, no new information provided by the cross-correlation function, when one has measured 
the three individual auto-correlation functions. In practice, the relation in equation (|A2[) and that for projected two- 
point correlation functions can be used for a consistency check. Furthermore, for understanding the mixture among 
different galaxy populations, the cross-correlation function is more readily interpreted than the consistency relation 
itself. For example, segregation of "red" and "blue" halos would produce a distinctive suppression of the one-halo term 
of the cross-correlation function, while its effect on the auto-correlation functions (boosting the red and blue auto- 
correlations relative to the all auto-correlation) might be difficult to disentangle from changes in satellite occupation 
slopes, concentration parameters, and so forth. 

A. 2 The Relation Between the Cross-correlation Function and the Geometric Mean of the Auto- correlation Functions 

The two-point cross-correlation function of two populations (e.g., red and blue galaxies we study here) is often 
compared to the geometric mean of the two-point auto-correlation functions to infer the information about the mixing 
of the the two populations. On large scales, where the two-halo term dominates the correlation functions, the cross- 
correlation function is guaranteed to be the geometric mean of the auto-correlation functions. 17 On small scales, where 
the one-halo term dominates, it is not obvious what we can infer if there are deviations of the cross-correlation function 
from the geometric mean. We show here that the situation becomes even less clear for projected correlation functions. 

As an example, consider the two-point auto-correlation functions of red and blue galaxies, £ re d and £bluej and their 
cross-correlation function £ CTOSS . Under the assumption that these correlation functions are positive, we have 

( / y£rcd(r p ,Tr)£,b\uc{r p ,n)dir J < / £, Icd (r p ,n)dir / £blue(r p , ir)djT (A3) 

from the Cauchy-Schwartz inequality. Even if we had £ cross = \/£rcd£biuc on au scales, the above inequality would 
mean that the projected correlation functions satisfy 

^p, cross _^ y ^p,rcd^p,bluc- v-'"/ 

The equality only holds for the case where both galaxy populations trace the same dark matter distribution and £ rc d 
and £biue are parallel to each other, which is true on large scales but not on small scales. 

B. ROBUSTNESS OF MEASUREMENTS 

Measuring projected correlation functions from galaxy redshift data involves a set of standard proced ures. We test 
here the sensitivity of our measurements to some key details of these procedures (described in §[2|. Figure [B22"l presents 
projected correlation functions for several variants of our standard technique, calculated for the M r < —21 sample, 
which yields the smallest statistical errors among our samples. The overall visual impression of the plotted projected 
correlation functions (top panels) demonstrates the robustness of the measurements, but we examine it in more detail 
by looking at the fractional deviations from our stand ard case (bottom pane ls). 

The measurements presented in the paper all use the lLandv fe Szal av (1993 ) estimator, th e standard for such studies, 
and we show here (left- hand side) results when using also the estimators of Hamilton ( 1993) (long-dashed red line) and 
iDavis fc Peebl es (1983) (short-dashed blue line). We find that the alternative estimators provide similar measurements 
to the Landy-Szalay one, with some differences on small scales for the Hamilton estimator and a significant deviation 
of the Davis-Peebles measurement on large scales. Our results are in accord with long-standing claims that the 
Davis- Peebles estimator is more sensitiv e to uncertainties in th e galax y mean density, wi th the differences becoming 
apparent on large scales (Hamilton 1993; Strauss & Willick 1995; Pons-Bordcria ct al. 1999), and that the Landy-Szalay 
estimator has improved shot- noise behavior (lLandv fe Szalavlfl9 93; Sz apudi fc Szalavlll998l ). A detailed examination 
of different clustering estimators is given by iPons-Borderia et al.l (|1999IK 

The angular selection function is carefully calculated in each sector on the sky, with an average completeness of 
0.97. The detailed angular completeness is mimicked in the random samples, and is then used to weight both real and 
random galaxies. The main analyses of the paper imposed a cut on the angular completeness of / comp > 0.5, to avoid 
shot noise from undersampled regions. A more conservative cut of / comp > 0.9 (which eliminates ~ 1400 additional 
galaxies; dot-dashed green line in the left-hand side) results in negligible changes to the measurements. If we drop the 
weighting of both real and random galaxies by the angular completeness, it also makes no difference to the results. 
However, neglecting to address the angular incompleteness altogether (i.e., by not including it in the random catalog 
and not weighting either data or randoms by it; dotted magenta line) results in a couple of percent difference. 

We test additional variants of our standard procedures in the right-hand side of Figure IB22I The dashed blue curve 
shows the effect of integrating w p (r p ) up to 7r max = 80 h' 1 Mpc instead of 60 h^ 1 Mpc. Replacing the random catalog 
with another realization of equal size makes negligible difference except at the smallest separations; the slight changes 
in the innermost bins show the importance of using a sufficiently large random catalog. Changing the assumed il m 
from 0.3 to 0.25 when calculating comoving separations produces a small (~ 2%) systematic shift of the measurements. 
Finally, relaxing the bright flux limit of 14.5, needed for defining the luminosity-threshold samples, might introduce 
occasional problems with galaxy deblending or saturation nearby (z < 0.05), but increasing the minimum redshift 
from 0.02 to 0.05 results in negligible differences. 

17 This statement relies on the fact that the host halo populations are tracing the same dark matter distribution. One can construct a 
physically absurd but mathematically acceptable model with zero cross-correlation by superposing the halo populations of two independent 
N-body simulations in a single cube, populating one with red galaxies and the other with blue. 
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Fig. B22. — Robustness of the w p measurements to different uncertainties. Top panels show projected correlation functions for different 
variants of our standard procedures (see text) calculated for the M r < —21 sample. Points with errorbars represent our standard measure- 
ments for this sample. Bottom panels show the fractional deviations of these variants compared to our standard case. For clarity, errorbars 
are shown only for the latter, and are comparable for all other cases. 

Table [B6l presents power-law fits to the projected correlation functions for each variant. In all of these cases, we find 
an overall change in the best-fitting power- law parameters of at most 1%. smaller than or compa rable to the statistical 
errorbars, highlighting the robustness of our results. While adopting the lDavis fc Peebles! (|1983l) estimator or entirely 
dropping angular completeness corrections makes a (marginally) noticeable change in parameter values, these choices 
would clearly not be optimal; the smaller changes associated with the [Hamilton! ([19930 estimator or with raising the 
completeness threshold are better indicators of the associated systematic uncertainty. 

C. CORRELATION FUNCTION MEASUREMENTS 

The following tables present the projected correlation function values that are used in this work, together with the 
diagonal error bars on the measurements. Table [C7l and Table [C8l present the measurements from the volume-limited 
luminosity samples described in § [3] The tables include the projected correlation functions measured for the samples 
defined by magnitude bin and thresholds, respectively. Table IC9I and Table IC10I present the measurements for the 
blue and red subsamples, respectively, analyzed in § 2) The full error covariance matrices, obtained from the jackknifc 
resampling, are available upon request. 
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TABLE B6 

Variants of the Standard Measurement for 

the Mr < —21 Sample 



Sample 


TO 


7 


x 2 

dof 


Standard 


5.98 ± 0.12 


1.96 ± 0.02 


6.1 


Davis — Peebles 


6.06 ± 0.11 


1.94 ± 0.01 


5.9 


Hamilton 


6.02 ± 0.10 


1.94 ± 0.01 


5.1 


/comp 


6.03 ± 0.11 


1.95 ± 0.02 


5.8 


No ang. comp. 


5.92 ± 0.12 


1.97 ± 0.02 


6.1 


TTmax = 80 


6.06 ± 0.13 


1.95 ± 0.02 


6.0 


Random 


6.01 ± 0.12 


1.95 ± 0.02 


6.1 


n m = 0.25 


5.97 ± 0.10 


1.97 ± 0.01 


6.1 


Zmin = 0.05 


6.01 ± 0.10 


1.95 ± 0.01 


6.5 



Note. -— rg and 7 are obtained from fitting a 
power-law to w p (r p ) for r p < 20 h~ 1 Mpc, using the 
full error covariancc matrices. 



TABLE C7 
Projected Correlation Function Measurements of Magnitude Bins Samples 



r p 


-23 - -22 


-22 - -21 


-21 - -20 


-20 - -19 


-19 --18 


-18 --17 


0.17 


2307 (510) 


536.1 (25.4) 


297.9 (9.9) 


269.0 (18.3) 


268.4 (46.7) 


211.6 (64.7) 


0.27 


1200 (208) 


359.0 (12.4) 


231.5 (7.8) 


208.7 (16.3) 


212.1 (37.0) 


203.9 (67.3) 


0.42 


713.4 (100.0) 


238.6 (6.5) 


166.4 (6.6) 


152.2 (13.1) 


153.7 (31.0) 


158.1 (58.7) 


0.67 


527.2 (62.0) 


148.7 (4.8) 


117.2 (5.4) 


108.8 (11.2) 


109.8 (22.1) 


114.4 (46.8) 


1.1 


274.1 (25.6) 


99.1 (3.2) 


79.3 (4.3) 


73.1 (7.9) 


75.1 (18.0) 


79.4 (34.1) 


1.7 


155.6 (15.4) 


65.6 (2.2) 


55.8 (3.6) 


48.5 (6.8) 


47.1 (12.0) 


53.0 (20.8) 


2.7 


109.7 (10.7) 


47.2 (2.0) 


40.4 (3.3) 


33.8 (5.2) 


31.4 (7.6) 


36.7 (13.9) 


1.2 


92.0 (5.9) 


34.0 (1.8) 


29.3 (3.1) 


24.2 (4.0) 


18.0 (4.0) 


21.7 (9.9) 


6.7 


56.1 (3.8) 


23.6 (1.5) 


20.9 (2.7) 


16.2 (3.3) 


9.43 (2.50) 


12.9 (6.2) 


10.6 


33.2 (3.1) 


14.9 (1.3) 


13.9 (2.0) 


9.94 (2.03) 


5.95 (1.68) 


2.43 (3.10) 


16.9 


19.4 (2.3) 


8.30 (0.92) 


7.87 (1.32) 


5.00 (1.53) 


3.73 (1.31) 


0.23 (3.04) 


26.8 


10.3 (1.7) 


4.08 (0.72) 


4.59 (1.09) 


2.47 (1.68) 


1.82 (1.11) 


-0.93 (1.71) 


42.3 


5.21 (1.26) 


2.68 (0.54) 


3.40 (0.85) 


1.11 (1.28) 


0.23 (1.15) 


-4.89 (1.85) 



Note. — The first column provides the pair-weighted projected separation of the bin. 
Subsequent columns provide the projected correlation function values, w p (r p ), for the volume- 
limited samples corresponding to the specified absolute magnitude M r bins. The diagonal 
terms of the error covariance matrices are given in parentheses. For reasonable power-law 
interpolations, the pair-weighted projected separations vary by at most 1% from the r p for 
which w p (r p ) equals the pair- weighted average in the bin (see § 12.21 1. 
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TABLE C8 
Projected Correlation Function Measurements of Magnitude Threshold Samples 

r p -22.0 -21.5 -21.0 -20.5 -20.0 -19.5 -19.0 -18.5 -18.0 

0.17 2615 (491) 1028 (68) 586.2 (19.5) 455.7 (11.3) 366.1 (9.3) 307.0 (9.2) 322.5 (17.0) 313.3 (25.9) 294.3 (34.7) 
0.27 1189 (202) 731.7 (34.0) 402.9 (11.7) 296.9 (6.9) 264.3 (7.6) 228.5 (8.3) 231.1 (15.3) 230.2 (24.9) 221.5 (32.1) 



0.42 


728.0 (96.3) 


392.6 (17.1) 


258.7 (6.7) 


197.0 (5.1) 


184.0 (6.6) 


159.3 (7.2) 


162.4 (12.8) 


165.4 (21.1) 


161.4 (27.6) 


0.67 


491.4 (55.3) 


228.6 (10.9) 


163.2 (4.7) 


134.1 (4.1) 


128.6 (5.5) 


110.4 (5.6) 


114.6 (10.3) 


118.3 (17.5) 


114.7 (22.0) 


1.1 


272.8 (23.2) 


144.6 (6.4) 


105.5 (3.0) 


89.4 (3.3) 


84.7 (4.3) 


72.9 (4.2) 


75.5 (7.7) 


79.7 (13.2) 


75.5 (16.5) 


1.7 


154.4 (14.5) 


94.3 (3.7) 


68.9 (2.2) 


61.1 (2.6) 


59.4 (3.6) 


49.8 (3.4) 


50.6 (6.0) 


53.8 (10.5) 


48.6 (11.5) 


2.7 


111.5 (10.4) 


70.5 (2.7) 


50.2 (2.1) 


44.0 (2.3) 


42.9 (3.3) 


34.6 (2.9) 


35.0 (4.7) 


37.4 (7.8) 


32.4 (7.7) 


1.2 


94.5 (5.6) 


48.6 (2.3) 


35.5 (1.8) 


31.2 (2.0) 


30.9 (3.1) 


24.6 (2.5) 


24.2 (3.6) 


25.9 (5.8) 


19.7 (4.4) 


6.7 


56.8 (3.8) 


33.1 (1.8) 


24.5 (1.6) 


21.3 (1.8) 


21.9 (2.7) 


16.7 (2.4) 


15.3 (2.9) 


17.4 (4.5) 


10.8 (2.8) 


10.6 


35.1 (3.2) 


20.9 (1.5) 


15.3 (1.3) 


13.7 (1.5) 


14.6 (2.1) 


10.7 (1.9) 


9.20 (1.78) 


10.6 (2.6) 


6.35 (1.93) 


16.9 


22.0 (2.2) 


11.6 (1.2) 


8.54 (0.94) 


7.65 (1.07) 


8.24 (1.32) 


5.73 (1.28) 


4.11 (1.29) 


5.31 (1.42) 


3.62 (1.34) 


26.8 


11.4 (1.6) 


6.04 (0.95) 


4.11 (0.71) 


4.09 (0.88) 


4.88 (1.06) 


2.82 (1.13) 


1.81 (1.39) 


3.56 (1.76) 


2.14 (1.23) 


42.3 


5.89 (1.21) 


3.28 (0.64) 


2.73 (0.54) 


3.21 (0.70) 


3.58 (0.85) 


1.39 (0.91) 


0.72 (1.24) 


0.96 (1.02) 


0.56 (1.26) 



Note. — The first column provides the pair-weighted projected separation of the bin. Subsequent columns provide the projected 
correlation function values, w p (r p ), for the volume-limited samples corresponding to the specified absolute magnitude M™ ax thresholds. 
The diagonal terms of the error covariance matrices are given in parentheses. 



TABLE C9 

Projected Correlation Function Measurements of Blue Galaxy Samples 

Corresponding to Magnitude Bins 



r p 


-23 - -22 


-22 - -21 


-21 - -20 


-20 --19 


-19 --18 


-18 --17 


0.17 




273.8 (40.0) 


131.7 (8.0) 


108.4 (8.5) 


87.6 (8.5) 


59.2 (10.8) 


0.27 




160.2 (19.6) 


101.1 (5.4) 


89.7 (6.9) 


72.1 (6.6) 


78.2 (11.7) 


0.42 




132.5 (11.6) 


80.3 (3.9) 


64.8 (4.4) 


52.1 (6.9) 


60.9 (10.1) 


0.67 


111.6 (350.0) 


77.7 (7.0) 


58.1 (3.0) 


48.8 (4.1) 


44.9 (4.7) 


46.2 (9.6) 


1.1 


164.5 (129.8) 


66.4 (4.4) 


45.3 (2.3) 


37.9 (2.8) 


34.1 (4.8) 


40.3 (8.1) 


1.7 


55.6 (62.6) 


47.0 (2.5) 


37.2 (2.0) 


27.9 (2.9) 


24.6 (3.6) 


31.5 (8.5) 


2.7 


20.9 (36.9) 


28.7 (1.9) 


27.9 (1.8) 


20.1 (2.5) 


18.0 (2.9) 


25.2 (8.5) 


4.2 


98.9 (32.8) 


23.8 (1.6) 


20.7 (1.7) 


16.1 (2.2) 


11.8 (2.0) 


17.9 (7.5) 


6.7 


56.9 (16.8) 


15.6 (1.1) 


14.5 (1.6) 


11.1 (1.9) 


7.26 (1.74) 


10.6 (4.7) 


10.6 


32.7 (13.8) 


10.6 (1.0) 


10.4 (1.4) 


7.17 (1.40) 


4.27 (1.20) 


1.95 (2.50) 


16.9 


25.6 (8.4) 


6.08 (0.79) 


6.33 (1.09) 


3.75 (1.15) 


2.87 (0.93) 


-1.01 (2.12) 


26.8 


15.7 (7.1) 


3.41 (0.61) 


3.77 (0.84) 


1.57 (0.96) 


1.26 (0.70) 


-0.68 (1.27) 


42.3 


13.6 (4.0) 


2.07 (0.51) 


2.95 (0.68) 


0.33 (0.71) 


0.15 (0.72) 


-3.64 (1.15) 



Note. — The first column provides the pair-weighted projected separation of the bin. 
The subsequent columns provide the projected correlation function values, w p (r p ), for the 
blue galaxy samples corresponding to the specified absolute magnitude M r bins. The diag- 
onal terms of the error covariance matrices are given in parentheses. 



TABLE C10 
Projected Correlation Function Measurements of Red Galaxy Samples Corresponding 

to Magnitude Bins 



r p 


-23 - -22 


-22 - -21 


-21 - -20 


-20 - -19 


-19 --18 


-18 --17 


0.17 


3158 (1061) 


821.7 (45.5) 


570.9 (24.4) 


724.0 (63.0) 


1623 (311) 


3182 (1439) 


0.27 


1300 (268) 


542.0 (24.1) 


433.1 (17.5) 


570.7 (51.3) 


1197 (252) 


2839 (1437) 


0.42 


875.7 (135.5) 


339.6 (12.3) 


305.1 (14.7) 


390.7 (39.3) 


852.6 (203.1) 


2034 (1218) 


0.67 


633.7 (82.3) 


201.7 (7.8) 


206.5 (11.4) 


257.5 (31.9) 


525.5.9 (138.8) 


1329 (909) 


1.1 


350.4 (35.0) 


132.4 (5.1) 


125.8 (8.6) 


157.1 (21.2) 


313.6 (106.9) 


749.7 (592.2) 


1.7 


164.3 (18.4) 


83.4 (3.5) 


82.0 (6.8) 


95.0 (17.3) 


165.2 (61.3) 


385.6 (296.9) 


2.7 


127.3 (13.1) 


60.3 (2.9) 


56.8 (5.5) 


61.3 (12.0) 


91.6 (29.4) 


166.3 (106.1) 


1.2 


107.0 (7.5) 


42.2 (2.6) 


39.6 (4.8) 


39.3 (8.2) 


42.1 (11.5) 


33.4 (22.9) 


6. 


61.3 (4.2) 


29.2 (2.2) 


28.0 (4.1) 


25.1 (6.1) 


14.5 (6.3) 


15.8 (18.5) 


10.6 


35.6 (3.5) 


18.2 (1.7) 


17.8 (2.9) 


14.1 (3.3) 


9.93 (3.82) 


-4.73 (7.73) 


16.9 


20.7 (2.5) 


9.74 (1.21) 


9.30 (1.68) 


6.97 (2.42) 


6.43 (3.32) 


14.5 (15.0) 


26.8 


10.2 (1.8) 


4.68 (0.92) 


5.47 (1.48) 


4.23 (3.20) 


2.99 (3.01) 


-5.02 (7.43) 


42.4 


4.69 (1.18) 


3.22 (0.68) 


3.85 (1.08) 


2.53 (2.40) 


0.29 (3.04) 


-15.6 (9.6) 



Note. — The first column provides the pair-weighted projected separation of the bin. The 
subsequent columns provide the projected correlation function values, w p (r p ), for the red galaxy 
samples corresponding to the specified absolute magnitude M r bins. The diagonal terms of the 
error covariance matrices are given in parentheses. 
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